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"Clostridium difficile vaccine" 

Tntroduction 

5 The invention relates to vaccines to provide immunological protection against C. 

difficile infection. 

Background 

10 Clostridium difficile is a common nosocomial pathogen and a major cause of 

morbidity and mortality among hospitalised patients throughout the world [Kelly et 
al., 1994]. Outbreaks of C. difficile have necessitated ward and partial hospital 
closure. With the increasing elderly population and the changing demographics of 
the population, C. difficile is set to become a major problem in the 21st century. The 

15 spectrum of C. difficile diseases range from asymptomatic carriage to mild diarrhoea 

to fulminant pseudomembranous colitis. Host factors rather than bacterial factors 
appear to determine the response to C. difficile [Cheng et al., 1997; McFarland et al., 
1991; Shimetal.. 1998]. 

20 Reports indicate that hypogammaglobulinaemia in children appears to predispose to 

the development of disease due to C. difficile and that therapy with intravenously 
administered gamma globulin can be associated with the clinical resolution of 
chronic relapsing colitis due to C. difficile disease [Leung et al.. 1991; Pelmutter et 
al., 1985]. A study by Mulligan et al. [1993] found elevated levels of 

25 immunoglobulins reactive with C. difficile in asymptomatic carriers as opposed to 

symptomatic patients. Recently it has been shown that patients who became 
colonised with C. difficile who had relatively low levels of serum IgG antibody 
against toxin A had a much greater risk of developing C. difficile diarrhoea [Kyne et 
al., 2000]. 

30 

It is clear that any advance in the understanding of C. difficile disease and methods 
of preventing or treating C. difficile diarrhoea (CDD) and other related diseases will 
be of major therapeutic potential. 



35 
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Statements of Invention 

According to the invention there is provided a vaccine for the treatment or 
prophylaxis of C. difficile associated disease, the vaccine comprising a C. difficile 
5 gene or a C. difficile peptide/polypeptide or a derivative or fragment or mutant or 

variant thereof which is immunogenic in humans. 

The invention also provides a vaccine for the treatment or prophylaxis of C. difficile 
associated disease, the vaccine comprising a C. difficile gene or C. difficile 
10 peptide/polypeptide or a derivative or fragment or mutant or variant ±ereof to which 

immunoreactivity is detected in individuals who have recovered from C. difficile 
infection. 

Preferably the gene encodes a C. difficile surface layer protein, SlpA or variant or 
15 homologue thereof. 

Preferably the peptide/polypeptide is a C. difficile surface layer protein, SlpA or 
variant or homologue thereof. 

20 Most preferably the vaccine comprises a chimeric nucleic acid sequence. Preferably 

the chimeric nucleic acid sequence is derived from the 5' end of the gene, encoding 
the mature N-terminal moiety of SlpA from C. difficile. 

In one embodiment of the invention the vaccine comprises a chimeric 
25 peptide/polypeptide. Preferably the amino acid sequence of the chimeric 

peptide/polypeptide is derived from the mature N-terminal moiety of SlpA from C. 

difficile. 



Preferably the vaccine of the invention contains an amino acid sequence SEQ ID 
No.l or a derivative or fragment or mutant or variant thereof. 
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Preferably the vaccine contains an amino acid sequence SEQ ID No. 2 or a derivative 
or fragment or mutant or variant thereof. 

In one embodiment of the invention the vaccine contains a nucleotide sequence SEQ 
5 ID No.3 or a derivative or fragment or mutant or variant thereof; a nucleotide 

sequence SEQ ID No.4 or a derivative or fragment or mutant or variant thereof; a 
nucleotide sequence SEQ ID No.5 or a derivative or fragment or mutant or variant 
thereof; a nucleotide sequence SEQ ID No.6 or a derivative or fragment or mutant or 
variant thereof; a nucleotide sequence SEQ ID No.7 or a derivative or fragment or 
10 mutant or variant thereof; a nucleotide sequence SEQ ID No.8 or a derivative or 

fragment or mutant or variant thereof; a nucleotide sequence SEQ ID No.9 or a 
derivative or fragment or mutant or variant thereof or a nucleotide sequence SEQ ID 
No.lO or a derivative or fragment or mutant or variant thereof. 

15 Preferably the vaccine of the invention is in combination with at least one other C. 

difficile sub-unit. 

The invention provides a vaccine for the treatment or prophylaxis of C. difficile 
associated disease, the vaccine comprising the mature N-terminal moiety of a surface 
20 layer protein. SlpA of C. difficile or variant or homologue thereof which is 

immunogenic in humans. 

Most preferably the N-terminal moiety of SlpA contains an amino acid sequence 
SEQ ID No. 1. 

25 

In one embodiment of the invention the N-terminal moiety of SlpA contains an 
amino acid sequence SEQ ID No. 2. 

The invention also provides a vaccine for the treatment or prophylaxis of C. difficile 
30 associated disease, the vaccine comprising an immunodominant epitope derived 
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from a C. difficile gene or a C. difficile peptide/polypeptide or a derivative or 
fragment or mutant or variant thereof which is immunogenic in humans. 

Preferably the vaccine of the invention comprises a pharmaceutically acceptable 
5 carrier. Most preferably the vaccine is in combination with a pharmacologically 

suitable adjuvant. Ideally the adjuvant is interleukin 12. Altemativley the adjuvant 
may be a heat shock protein. 

In one embodiment of the invention the vaccine comprises at least one other 
10 pharmaceutical product. 

The pharmaceutical product may be an antibiotic, selected from one or more 
metronidazole, amoxycillin, tetracycline or erythromycin, clarithromycin or 
tinidazole. 

15 

In one embodiment of the invention the pharmaceutical product comprises an acid- 
suppressing agent such as omeprazole or bismuth salts. 

The vaccine of the invention may be in a form for oral administration, intranasal 
20 administration, intravenous administration or intramuscular administration. 

In one embodiment of the invention the vaccine includes a peptide delivery system. 



The invention also provides an immunodominant epitope derived from a C. difficile 
gene or a C. difficile peptide/polypeptide or a derivative or fragment or mutant or 
variant thereof. Preferably the C. difficile peptide/polypeptide contains an amino 
acid sequence SEQ ID No.l or SEQ ID No.2 or a derivative or fragment or mutant or 

variant thereof. 

In one embodiment of the invention the C. difficile peptide/polypeptide contains an 
amino acid sequence SEQ ID No.3 or SEQ ID No.4 or SEQ ID No.5 or SEQ ID 
No.6 or SEQ ID No.7 or SEQ ID No.8 or SEQ ID No. 9 or SEQ ID No. 10 or a 
derivative or fragment or mutant or variant thereof. 
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The invention further provides a chimeric nucleic acid sequence derived from the 5' 
end of the slpA gene encoding the mature N-terminal moiety of SlpA from C. 
difficile which is immunogenic in humans. 

5 The invention also provides a chimeric peptide/polypeptide wherein the amino acid 

sequence of the chimeric peptide/polypeptide is derived from the mature N-terminal 
moiety of SlpA from C. difficile. 

The invention provides a C. difficile peptide comprising SEQ ID No. 1 or SEQ ID 
10 No. 2 or SEQ ID No. 3 or SEQ ID No. 4 or SEQ ID No. 5 or SEQ ID No. 6 or SEQ 

ID No. 7 or SEQ ID No. 8 or SEQ ID No. 9 or SEQ ID No. 10. 

One aspect of the invention provides for the use of a C. difficile gene or a C. difficile 
peptide/polypeptide or a derivative or fragment or muteint or variant thereof which is 
15 immunogenic in humans in the preparation of a medicament for use in a method for 

the treatment or prophylaxis of C. difficile infection or C. difficile associated disease 
in a host. 

Preferably the medicament which is prepared is a vaccine of the invention. 

20 

The invention also provides a method for preparing a vaccine for prophylaxis or 
treatment of C. difficile associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
25 derivative or fragment or mutant or variant thereof which is immunogenic in 

humans; and 

forming a vaccine preparation comprised of said gene or peptide/polypeptide 
or derivative or fragment or mutant or variant, which is suitable for 
30 administration to a host and which when administered raises an immune 

response. 
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Preferably the C. difficile peptide/polypeptide contains an amino acid sequence SEQ 
ID No.l or SEQ ID No.2 or a derivative or fragment or mutant or variant thereof. 

Most preferably the C. difficile gene contains an amino acid sequence SEQ ID No.3 
5 or SEQ ID No.4 or SEQ ID No.5 or SEQ ID No.6 or SEQ ID No. 7 or SEQ ID No.8 

or SEQ ID No.9 or SEQ ID No. 10 or a derivative or fragment or mutant or variant 
thereof. 

The invention further provides a method for prophylaxis or treatment of C. difficile 
10 associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is immunogenic in 
humans; 

15 

forming a vaccine preparation comprised of said gene or peptide/polypeptide 
or derivative or fragment or mutant or variant, and 

administering the vaccine preparation to a host to raise an immune response. 

20 

One aspect of the invention provides monoclonal or polyclonal antibodies or 
fragments thereof, to a C. difficile peptide/polypeptide or a derivative or fragment or 
mutant or variant thereof which is immunogenic in humans. 

25 Another aspect of the invention provides monoclonal or polyclonal antibodies or 

fragments thereof, to C. difficile peptide/polypeptide or a derivative or fragment or 
mutant or variant thereof to which immunoreactivity is detected in individuals who 
have recovered from C. difficile infection. 



30 



The invention also provides purified antibodies or serum obtained by immunisation 
of an animal with a vaccine of the invention. 



wo 02/062379 PCT/IE02/00017 



The invention provides the use of the antibodies or fragments of the invention in the 
preparation of a medicament for treatment or prophylaxis of C. difficile infection or 
C. difficile associated disease. 

5 Preferably the antibodies or serum are used in the preparation of a medicament for 

treatment or prophylaxis of C. difficile infection or C. difficile associated disease. 

Most preferably the antibodies or fragments or serum of the invention are used in 
passive immunotherapy for established C. difficile infection. 

10 

In one embodiment of the invention the antibodies or fragment or serum of the 
invention are used for the eradication of C. difficile associated disease. 

The invention also provides use of interleukin 12 as an adjuvant in C. difficile 
15 vaccine. 

The invention further provides use of humanised antibodies or serum for passive 
vaccination of an individual with C. difficile infection. 

20 

Brief Description of the Drawings 

The invention will be more clearly understood from the following description thereof 
given by way of example only with reference to the accompanying figures, in 
25 which: - 

Fig. lA is a Western blot showing recognition of antigens from a crude 
extract of C. difficile 171500 (PGR type 1) by serum antibodies from a 
patient infected with this strain. Lane 1: Pre-infection; Lane 2: Early acute; 
3Q Lane 3: Late acute; Lane 4: Convalescent; 
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Fig. IB is a Western blot showing recognition of antigens from a crude 
extract of C. difficile 170324 (PGR type 12) by serum antibodies from a 
patient infected with this strain. Lane 1: Pre-infection; Lanes 2-5: Acute; 
Lanes 6-7: Convalescent; 

5 

Fig. 2. is a Western blot showing recognition of antigens from two C. difficile 
strains of different type by serum from convalescent patients. 

Lane 1: Strain 170324 (PGR type 12), crude antigen preparation 

Lane 2: Strain 170324, surface layer protein preparation 
10 Lane 3: Strain 171500 (PGR type 1), crude antigen preparation 

Lane 4: Strain 171500, surface layer protein preparation. 

Molecular mass markers (kDa) are shown on the left; and 



Fig. 3 is an SDS-PAGE gel showing crude SLF preparations from selected 
strains of C. difficile. The gel contains 12% acrylamide, and has been stained 
for protein with Coomassie Blue. Each lane contains 5 fig of protein. 
Molecular weight markers are shown on the left. 

Lanel: 171500 (PGR type 1) 

Lane 2: 172450 (PGR type 5) 

Lane 3; 170324 (PGR type 12) 

Lane 4: 171448 (PGR type 12) 

Lane 5: 171862 (PGR type 17) 

Lane 6: 173644 (PGR type 31) 

Lane 7: 170444 (PGR type 46) 

Lane 8: 170426 (PGR type 92) 



Detailed Description of the invention 

30 Two antigenic peptides containing SEQ ID No. 1 and SEQ ID No. 2, associated with 

two common infecting types of C. difficile, were found to be immunogenic in 
humans. The antigenic peptides were found to induce a strong immune response in 
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individuals who recover from C. difficile infection. Individuals who have recovered 
from C. difficile infection are those individuals who have been exposed to C. difficile 
or something strongly related and have recovered. This includes individuals where a 
carrier state exists in that the C. difficile infection has not and will not necessarily 
5 become clinically significant. 

These antigenic peptides were found to be products of the slpA gene from C. difficile 
which is the structural gene for the surface layer protein, SlpA The gene or its 
products are therefore ideal candidates for the preparation of vaccines against C. 
10 difficUe. 

Surface layer proteins (SLPs), also known as S-layers or crystalline surface layers, 
are associated with a wide range of bacterial species. They form a 2-dimensional 
array, which covers the surface of the cell completely, and grows with the cell 

15 [Sleytr et al., 1993]. The molecular weight can range from 40 000 to 200 000 Da. 

The proteins are typically acidic, contain a large proportion of hydrophobic amino 
acid residues, and have few or no sulphur-containing amino acid residues. 
Glycosylated S-layer proteins occur in some species. The precise function of S- 
layers is not always known, but since they comprise approximately 15% of the cell 

20 protein, it seems likely that they are important for in vivo functioning of the 

organism. In Gram positive organisms, the SLP has been shown to delay or prevent 
the excretion of degradative enzymes from the cell to the outside milieu, and may 
thereby create a space analagous to the periplasmic space of Gram negative bacteria. 
Many pathogenic species possess SLPs, which have been ascribed functions such as 

25 antiphagocytosis (Campylobacter fetus), and inhibition of complement-mediated 

killing (Aeromonas salmonicida). 

Kawata et al. [1984] described the SLPs of Clostridium difficile. They showed the 
S-layer to be composed of 2 polypeptides, and demonstrated size heterogeneity for 
30 the polypeptides from different strains. Delmee et al. [1986] showed that crude 

extracts from C. difficile strains of different serotype showed different polypeptide 
profiles in SDS-PAGE. Poxton et al. [1999] made similar observations using 
purified SLP preparations. Slide agglutination [Delmee et al., 1990] has identified 21 
different serotypes, apparently distinguished by the heterogeneity of the SLP. 



35 
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Pantosti et al. [1989] isolated C. difficile from a number of patients with jintibiotic- 
associated diarrhoea, and prepared SLPs from them.. Cerquetti et al. [2000] 
published N-terminal sequences of SLPs from several strains, indicating wide 
differences between strains.. In 2000 the complete DNA sequence of the C. difficile 
5 genome was published (available at web address 

http://www.sanger.ac.Uk/Proiects/C difficile/) . 

The peptides of the invention were found to be encoded by a single open reading 
frame (ORF) named slpA from C. difficile. The peptides identified in our clinical 
10 study correspond to a lower molecular weight moiety of the slpA gene product. 

Since an immune response is also mounted against a higher molecular weight slpA 
gene product (Fig. 2), this entity may also be included in a vaccine. 

The slpA gene has been sequenced from a number of strains corresponding to 

15 different PGR types. The sequences of strains 171500 (PGR type 1)(NCIMB 41081; 

PHLS R13537), 172450 (PGR type 5)(PHLS R12884), 170324 (PGR type 12) 
(NGIMB 41080; PHLS R12882),, 171448 (PGR type 12) (PHLS R13550). 171862 
(PGR type 17) (PHLS R13702), 173644 (PGR type 31) (PHLS R13711), 170444 
(PGR type 46) (PHLS R12883) and 170426 (PGR type 92) (PHLS R12871) with 

20 translations thereof are given in Appendices 1 to 8. Substantial variation in 

nucleotide and predicted amino acid sequence was found between strains of PGR 
types 1, 5, 12, 17 and 31. The genes from strains of PGR types 46 and 92 are almost 
identical in sequence to those of PGR type 12. When the DNA sequences of genes 
of different strains within a PGR type are compared, the sequences are almost if not 

25 quite identical, indicating that the potential for variation is not infinite. These 

findings are in agreement with serotyping studies [Delmee et al., 1986, 1990], and 
indicate that the production of an effective vaccine based on the slpA product is 
feasible. In this respect, the present invention includes all variant slpA genes and 
their products, individually and combined, fragments of them, and their mutants and 

30 derivatives. 

One aspect of the invention provides the combination of immunodominant eptopes 
from the slpA gene products from various serotypes into a single vaccine. In this 
way a single vaccine may be used to immunise against several different C. difficile 
35 strains. 
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The most common PGR types isolated from infections in the clinical study carried 
out at St. James's Hospital, Dublin, Ireland were PGR types 1 and 12. However, a 
vaccine which elicits an intense antibody response against many infecting types 
would be therapeutically very valuable. Recombinant DNA chimera, or several 
5 chimeras, encoding contiguous immunodominant epitopes may be made for use in 

the vaccine. The recombinant DNA may serve as the active component in a vaccine, 
or may be inserted into an appropriate expression system for the generation of a 
chimeric peptide vaccine in a suitable host. 

10 Chimeras can be generated by PGR amplification of the DNA encoding peptide 

regions of interest, incorporating cleavage sites for restriction endonucleases into the 
primers. The amplified fragments can thus be cleaved to generate compatible ends, 
and spliced together to create chimeras. 

15 The dominant epitopes may be identified by cleavage of the slpA products into 

fragments by agents which cleave at known sites, and by immunoblotting with 
homologous patient serum. Immunodominant peptides may be tested for their 
capacity to stimulate T-cell proliferative responses in vitro, using mouse splenic T- 
cells. 

20 

DNA vaccination involves immunisation with recombinant DNA encoding the 
antigen or epitope of interest, cloned in a vector which promotes high level 
expression in mammalian cells. Typically, the vector is a plasmid vector which 
which also replicates in a procaryotic vector such as Escherichia coli, so that the 
25 DNA can be produced in quantity. Following immunisation, the plasmid enters a 

host cell, where it remains in the nucleus, and directs synthesis of the recombinant 
polypeptide. The polypeptide stimulates the production of neutralising antibodies, as 
well as activating cytotoxic T-cells. 

30 Using a DNA vaccine, it may be necessary to modify the DNA sequence to take 

account of codon usage in humans. The G+G content of mammalian DNA is much 
higher than that of C. difficile. The generation of such synthetic DNA molecules, 
essentially containing numerous silent mutations, is within the scope of the 
invention. 



35 
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A peptide vaccine will ideally be made using recombinant peptides. Similar 
considerations apply as in the generation of a DNA vaccine with regard to expression 
in a different host, such as Escherichia coli, which has a different codon usage 
pattern to C. difficile. Problems of expression may be overcome by the use of a 
5 special host strain which carries additional copies of rare tRNAs (e.g. E. coli BL21- 

CodonPlus™-RIL from Stratagene), or by using de novo synthesis of a DNA 
segment carrying silent mutations which will enable normal expression in E. coli. 
There are many expression systems which are likely to allow high-level expression 
of slpA genes in E. coli. An example is the pB AD/Thio TOPO vector of Invitrogen, 

10 in which expressed genes are under control of the arabinose promoter, which is 

subject to positive and negative control, enabling very tight control of expression. In 
this vector, the recombinant protein is typically fused to a modified thioredoxin 
carrying several histidine residues which enable purification by nickel 
chromatography. The recombinant protein can be cleaved from the thioredoxin 

15 moiety by enterokinase enzyme. 

Affinity chromatography may also be used with fixed antibodies or some other agent 
which strongly binds the peptide of interest to purify the protein from the native 
organism. 

20 

Purified immunogenic peptides may be used in combination with other C. difficile 
sub-units as a combined vaccine against C. difficile. Potential candidates are the 
products of the other sip genes, which share limited homology with the slpA gene 
product and with the N-acetylmuramoyl L-alanine amidase, (CwlB), from Bacillus 
25 subtilis, and which may be involved in remodelling of the peptidoglycan. 

Oother purified proteins of C. difficile to which constitutive antibodies are detected 
in individuals recovering from C. difficile infection are also within the scope of the 
present invention 

30 

A deposit of Clostridium difficile strain 171500, PGR type 1, was made at the 
NCIMB on January 29, 2001, and accorded the accession number NCIMB 41081. 

A deposit of Clostridium difficile strain 170324, PGR type 12, was made at the 
35 NGIMB on January 29, 2001, and accorded the accession number NGIMB 41080. 
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Two peptides of the invention were found to contain the following sequences: 
33kDa peptide 

SEQ ID No. 1 : DKTKVETADQGYTVVQSKYK 

5 

31kDa peptide 

SEQ ID No. 2 ATTGTQGYTWKNDGKKAVK 

The invention will be more clearly understood from the following examples. 

10 

Example 1. Clinical Study 

Examination of sequential antibody responses to C. difficile among elderly patients 
who developed the disease was carried out. The study was based on the hypothesis 
15 that the host immune response influenced the development of Clostridium difficile 

disease. In particular we determined that a particular pattern of immune response to 
C. difficile antigens correlated with the outcome of CDD. 

Materials and Methods 

20 

Patients 

Serum was collected from over 300 patients and of these 30 patients developed 
CDD. The infecting strain (homologous strain) was grown from each patient. 
Strains of C. difficile were typed at ttie Anaerobe Reference Laboratory, Wales 

25 [O'Neill et al., 1996]. The most common strains isolated were PGR type 1 (n = 15) 

which is the most common type causing epidemics and PCR type 12 (n = 5) which is 
also a common hospital strain. Pre-infection serum samples were obtained from 
patients. Acute phase sera were then collected from patients who developed C. 
difficile disease. Convalescent sera were collected from patients who recovered. 

30 Protein extracts of patients' infecting C. difficile strain were probed with the patients 

sera using Western blotting. IgG responses to the antigens were examined. 

Western blotting 

Proteins from SDS-PAGE gels were electroblotted (0.8mA/cm2 for 1 h) to PVDF 
35 membrane using a semi-dry blotting apparatus (Atto). Primary antibodies (human 
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serum: 1/50 - 1/10,000 dilution) were detected using a 1/5000 dilution of anti-human 
IgG (horse radish peroxidase-conjugated) in combination with enhanced 
chemiluminesence (ECL). Blots were washed in phosphate buffered saline (pH 7.5) 
containing Tween 20 (0.1% v/v), and incubated in the same solution comprising 
5 dried skim milk (5% w/v) and antibodies at the appropriate concentration. Blots 

were exposed to Kodak X-OMAT film for various periods of time and developed. 

Results 

Overall 5 patients made a full recovery and new antibody responses to previously 
10 unrecognised antigens were evident in 4 of these patients. Three of these patients 

had C. difficile belonging to PCR type 1 and one patient had C. difficile PGR type 12. 

These patients developed an acute phase antibody response to previously 

unrecognised C. difficile antigens which persisted during convalescence (Figs. lA 

and IB). These antigens were recognised by antibodies from patients who recovered 
15 and represent potential candidate vaccine antigens. Fig lA shows a strong reaction 

of convalescent antibodies was observed with the 33 kDa antigen (Lane 4, arrow). 

Fig IB shows a strong reaction of convalescent antibodies was observed with the 31 

kDa antigen (Lanes 6 and 7, arrow). 

20 These antibody responses have also been found in some controls in the same ward 

who were also on antibiotics but who did not develop CDD. 

Example 2. Further characterisation of protective antigens 

25 Materials and Methods 

Partial purification and N-terminal sequencing of the 33 kDa and the 31 kDa proteins 

The antigens were partially purified from C. difficile based on their molecular weight 
using preparative continuous-elution SDS-PAGE on a model 491 Prep-Cell (Bio- 
Rad). The appropriate antigens were subsequently identified on Western blots 
30 probed with serum obtained from individuals who recovered from C. difficile 

infection. 

Preparation of surface layer proteins (SLPs) 

SLPs were purified from C. difficile by extracting washed cells with 8 M urea, in 50 
35 mM Tris HCl, pH 8.3 in the presence of a cocktail of protease inhibitors 
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(Complete®, Boehringer Mannheim), for 1 h at 37°C, followed by centrifugation for 
19 000 X g for 30 min. The SLPs were recovered in the supematJint and dialysed to 
remove the urea [Cerquetti at al., 2000]. 

5 Results 

The immunodominant protein which was associated with a positive outcome from C. 
difficile strain 171500 (PGR type 1) was identified and purified using preparative 
SDS-PAGE. The N-terminal region of the protein was sequenced using an Applied 
Biosystems Procise Sequencer, viz DKTKVETADQGYTVVQSKYK (SEQ ID 
10 No. 1) 

The antigen which was associated with a protective antibody response from the C, 
difficile strain YJQZTA (PGR type 12) was identified and the N-terminal sequence 
obtained, viz ATTGTQGYTWKNDGKKAVK (SEQ ID No. 2). 

15 

These sequences were used to interrogate the C. diffcile genome sequence using the 
TBLASTN programme, which compared our query sequences with those of the 
genome project (available at web address 

http : //www. san ger. ac. uk/Proi ects/C difficile/) . translated in all 6 possible reading 

20 frames. A nearly identical stretch of sequence was identified when the sequence 

from strain 1710324 (type 12) was used for interrogation. The same stretch of 
sequence was picked up with the sequence from strain 171500 (type 1) was used, 
although the identity was much less strong. Since the homologous sequence 
belonged to an open reading frame encoding a 719-residue peptide, this result was 

25 somewhat surprising. However, when the N-terminal sequences from the higher 

molecular weight SLP component were later published by Cerquetti et al [2000], it 
became apparent that they were encoded downstream along the same gene, 
subsequently identified as slpA, and the reason for the discrepancy in size between 
the gene and its products became readily apparent. 

30 

The purified SLPs from strains 171500 (PGR type 1) and 170324 (PGR type 12) 
showed strong reactivity with homologous convalescent serum, and co-migrated 
with the dominant antigens detected in crude cell extracts as shown in Fig. 2. Lanes 
1 and 3 contain crude antigen preparations from PGR types 1 and 12 respectively, 
35 and Lanes 2 and 4 contain SLP preparations from PGR types 1 and 12, respectively. 
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Panel A was probed with serum from a patient recovering from infection with PGR 
type 1, and Panel B was probed with serum from a patient recovering from infection 
with PGR type 12. Each serum detected 2 major antigens in the infecting strain 
(Panel A, Lane 3); (Panel B, Lane 1), which co-migrated with the 2 SLPs (Panel A, 
5 Lane 4; Panel B, Lane 2), with which the sera also reacted strongly. Note that serum 

from the patient infected with the PGR type 1 strain recognised the higher molecular 
weight SLP from the PGR type 12 strain (Panel A, Lanes 1 and 2), whereas the 
converse did not occur (Panel B, Lanes 3 and 4). There is no apparent antigenic 
cross-reactivity with regard to the lower molecular weight SLPs. 

10 

SLPs were prepared from selected strains by urea extraction, and subjected to SDS- 
PAGE and staining with Goomassie Blue (Fig. 3). Most strains showed a 
characteristic profile, with two major bands located in the 29 000 to 36 000 and 45 
000 to 50 000 molecular weight range. An exception was strain 172450 (Fig. 3, 
15 Lane 2), which showed a single, high molecular weight band, approximately 43 000 

in size. 

Cloning, sequencing and analysis of slpA genes 

The nucleotide sequences of the slpA genes from the two sample strains of C. 

20 difficile (PGR types 1 and 12, deposited at the NGIMB) and of several others (PGR 

types 5, 12, 17, 31, 46 and 92, available from the Anaerobe Reference Unit at the 
Department of Medical Microbiology and Public Health Laboratory, Gardiff, Wales 
were obtained. The slpA gene and flanking sequence was amplified by polymerase 
chain reaction from genomic DNA prepared from C. difficile using a commercial kit 

25 (Puregene® DNA isolation kit for yeast and Gram positive bacteria, Gentra systems 

Minneapolis, MN). The forward primer (5' ATGGATTATTATAGAGATGTGAG 
3'), was based on sequence from the genome sequencing project, starting 112 
nucleotides upstream from the start of the slpA open reading frame. Two reverse 
primers were used, depending on the PGR type. A downstream primer (5' 

30 GTATTTAAAGTTTTATTAAAACTTATATTAC 3') was used to amplify slpA 

from PGR types 12, 17, 31, 46 and 92. A reverse primer based on the 3' end of the 
slpA open reading frame from strain 630 and the subsequent nonsense codon (5' 
TTAGATATGTAATAAATGTTTGATTTTGTTTATAACTG 3') was used to 



wo 02/062379 



17 



PCT/IE02/00017 



amplify slpA from PGR types 1 and 5. The choice of primer for the latter two PGR 
types may have resulted in a small number of systematic errors in the nucleotide 
sequence obtained. PGR was carried out using HotStar"^"^ Taq polymerase (Qiagen 
Ltd., Crawley, West Sussex, UK) according to the manufacturer's instructions. A 
5 single fragment of approximately 2 kb was obtained for each strain, which was then 

cloned into the pBAD/Thio TOPO vector (Invitrogen, Groningen, Netherlands). 
Inserts were sequenced from both ends by standard procedures in commercial 
facilities at MWG (Wolverton Mill South, Milton Keynes, UK) and Cambridge 
University. New primers were designed on the basis of initial sequencing results, 
10 enabling sequencing of both strands to be completed (a process known as 

chromosome walking). 

The results are shown in Appendices 1-8. 

15 The nucleotide sequences were translated to enable prediction of the amino acid 

sequence(s) of the product(s) (Appendices 1-8). The N-terminal sequences obtained 
experimentally for the low molecular weight protective antigens from strains 171500 
(PGR type 1) and 170324 (PGR type 12) were almost identical to those predicted 
from the nucleotide sequences of their respective slpA genes (18/20 identical 

20 residues for strain 171500, and 19/20 identical residues for strain 170324). 

Appendix 1 shows the open reading frame with translation for slpA from strain 
171500 (PGR type 1), SEQ ID No 3. Since the reverse primer was based on the 35 
nucleotides from the 3' end of the slpA gene, the sequence is not necessarily 100% 
25 accurate in this region. However, this part of the gene does not seem to vary greatly 

from strain to strain. 

Appendix 2 shows the open reading frame with translation for slpA from strain 
172450 (PGR type 5), SEQ ID No 4. Again, the sequence obtained for the 3' 35 
30 nucleotides is not fully reliable. This gene is considerably smaller than the other 

slpA genes sequenced, and shows strong sequence divergence from the other PGR 
types examined. 

Appendix 3 shows the open reading frame with translation for slpA from strain 
35 170324 (PGR type 12) , SEQ ID No 5. This gene showed a single base difference 
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when compared with the strain used for the genome sequencing project, strain 630, 
of the same PGR type. The deduced amino acid sequence is identical. 

Appendix 4 shows the open reading frame with translation for slpA from strain 
5 171448 (PGR type 12), SEQ ID No 6. This gene was almost identical in sequence to 

that from strain 170324. 

Appendix 5 shows the open reading frame with translation for slpA from strain 
171862 (PGR type 17), SEQ ID No 7. 

10 

Appendix 6 shows the open reading frame with translation for slpA from strain 
173644 (PGR type 31), SEQ ID No 8. Like the slpA from strain IITASQ, this 
sequence is very dissimilar to those of slpA genes from other PGR types 
encountered. 

15 

Appendix 7 shows the open reading frame with translation for slpA from strain 
170444 (PGR type 46), SEQ ID No 9. This sequence is virtually identical to that 
obtained for slpA from PGR type 12 and 92 strains. 

20 Appendix 8 shows the open reading frame with translation for slpA from strain 

170426 (PGR type 92), SEQ ID No 10. This sequence is virtually identical to that 
obtained for slpA from PGR type 1 2 and 46. 

The cleavage site of the putative signal sequences from both genes was determined 
25 from experimental evidence (the N-terminal sequence of the mature proteins as 

determined by Edman degradation), and by the prediction tool of the Centre for 
Biological Sequence Analysis at the Technical University of Denmark [Nielsen et 
al., 1997]. The site for cleavage of the slpA gene product to form the mature SLPs 
was predicted from experimental [Gerquetti et al., 2000, Karjalainen et al., 2001 and 
30 Galabi et al., 2001]. The cleavage site is typically preceded by the motif TKS. 

However, the relevant motif is likely to be TKG in strain 173644 (PGR type 31). No 
obvious motif appeared for strain 172450 (PGR type 5). However, the protein 
produced by type 5 strains does appear to be cleaved; hence we predicted the site to 
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occur at a point where the SLP sequence aligns with the cleavage sites of other PGR 
types. 

The molecular weight and isoelectric point was calculated for each of the predicted 
5 mature proteins by the ExPASy server of the Swiss Institute for Bioinformatics 

(Table 1). In general, the calculated molecular weights were in fair agreement with 
apparent molecular masses determined from migration in gels (Fig. 3). No lower 
molecular weight band was apparent for Strain 172450 (PGR type 5; Lane 2). 
However, a higher molecular weight band is present, which is sunilar m size to the 
10 predicted weight for the C-terminal moiety. We observed a similar profile for 

another type 5 strain. It is possible that the lower molecular weight species is subject 
to degradation in this strain. Another possibility is that it is heavily glycosylated, 
which can affect staining. All peptides had a predicted isoelectric point below 7, 
typical of acidic proteins, and characteristic of SLPs in general [Sleyter et al, 1993]. 

15 

Table 1 



C. difficile strain (PCK type) 


Pl 

(N-terminal) 


Pl 

(C-terminal) 


MW 
(N-terminal) 


MW 
(C-terminal) 


171500 (Type 1) 


4.83 


4.66 


33365.41 


44220.37 


172450 (Type 5) 


4.86 


4.65 


19364.46 


42757.63 


170324 (Type 12) 


4.92 


4.58 


34228.25 


39522.24 


171448 (Type 12) 


4.98 


4.58 


34156.18 


39492.21 


171862 (Type 17) 


5.09 


4.53 


33783.73 


39407.11 


173644 (Type 31) 


5.05 


4.56 


33626.48 


41821.69 


170444 (Type 46) 


5.06 


4.58 


34230.31 


39522.24 


170426 (Type 92) 


4.99 


4.58 


34242.32 


39522.24 



The translated nucleotide sequences were compared with published SlpA sequences 
(EMBL Accession numbers AJ300676, and AJ300677 for examples from PGR types 
1, and 17 respectively; strain 630 available from the Sanger Institute for PGR type 
12; EMBL Accession number AY004256 for a variant from an unnamed PGR type). 
The Clustal W alignment programme, which is freely available, was used. Where 
SlpA sequences from our isolates were compared with those of other strains of the 
same PGR types, they were found to be nearly or quite identical. This observation 



20 



25 
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indicates, together with existing knowledge from serotyping, that the number of 
variants of slpA is not infinite, and that natural evolution of the gene is not rapid. 
Table 2 shows a compilation of homologies, based on amino acid residue identity, 
for the different translated sequences measured against published sequences. 
5 Homologies are compiled for the predicted mature peptides, either combined (Table 

2A) or as N-terminal (low molecular weight, less conserved moiety) (Table 2B) and 
C-terminal (high molecular weight, more conserved) (Table 2C) mature peptides 
according to predicted cleavage sites. It is clear that the SlpA sequences from strains 
172450 (PGR type 5) and 173644 (PGR type 31) are quite distinct particularly with 
10 respect to N-terminal region. 

Table 2A 



Strain.type 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


171500.typel 


55.2 


99.7 


55.4 


56.42 


172450.type5 


49.8 


54.0 


49.9 


47.77 


170324.typel2 


100.0 


57.8 


81.7 


59.77 


171448.typel2 


99.7 








171862.typel7 


82.3 


58.7 


100 


57.54 


173644.type31 


57.9 


59.2 


60.1 


56.88 


170444.type46 


99.6 








170426.type92 


99.9 









Table 2B 



Strain.type 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


171500.typel 


35.4 


100 


34.5 


33.54 


172450.type5 


31.6 


32.2 


31.0 


24.58 


170324.typel2 


ICQ 


34.9 


64.6 


36.14 


171448.typel2 


99.7 








171862.typel7 


64.3 


34.4 


100 


31.55 


173644.type31 


37.5 


34.1 


41.3 


31.86 


170444.type46 


99.1 








170426.type92 


99.7 
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Table 2C 



Strain. typs 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


1715O0.typel 


70.2 


99.5 


71.2 


73.80 


172450.type5 


58.4 


60.4 


63.0 


57.60 


170324.typel2 


100 


77.3 


97.1 


80.00 


171448.typel2 


99.7 








171862.typel7 


97.3 


78.8 


100 


79.62 


173644.type31 


74.1 


78.9 


75.1 


75.38 


170444.type46 


100 








170426.type92 


100 









The term antibody used throughout the specification includes but is not limited to 
polyclonal, monoclonal, chimeric, single chain. Fab fragments and fragments 
produced by a Fab expression library. 

10 The antibodies and fragments thereof may be humanised antibodies. Neutralising 

antibodies such as those which inhibit biological activity of the substance amino acid 
sequence are especially preferred for diagnostics and therapeutics. 

Antibodies both polyclonal and monoclonal which are directed against epitopes 
15 obtainable from a polypeptide or peptide of the present invention are particularly 

useful in diagnosis and those which are neutralising are useful in passive 
immunotherapy. 



Antibodies may be produced by any of the standard techniques well known in the art. 

20 

A therapeutically effective amount of the polypeptide, polynucleotide, peptide or 
antibody of the inventiion in the form of pharmaceutical composition may be 
adminsistered. The composition may optionally comprise a pharmaceutically 
acceptable carrier, diluent or excipients and including combinations thereof. The 
25 pharmaceutical composition may be used in conjugation with one or more additional 

pharmaceutically active compounds and/or adjuvants. 
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Different adjuvants depending on the host may be used to increase immunological 
response. The adjuvant may be selected from the group comprising Freunds, mineral 
gels such as aluminium hydroxide and surface active substances. 

5 The vaccine of the invention may be in the form of an immune modulating 

composition or pharmaceutical composition and may be administered by a number of 
different routes such as by injection (which includes parenteral, subcutaneous and 
intramuscular injection) intranasal, intramuscular, mucosal, oral, intra-vaginal, 
urethral or ocular administration. There may be different formulation/composition 
1 0 requirements dependent on the different delivery systems. 



The invention is not limited to the embodiments hereinbefore described which may 
15 be varied in detail. 
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Appendix 1 

SEQ ID No. 3. Nucleotide sequence of slpA from Clostridium 
difficile strain 171500, PGR type 1, with translation. The putative 
secretory signal cleavage site (A) and site of cleavage to form the 
two mature SLPs (♦) are indicated. 

1 ATGT^TAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCA 60 
IMNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTATTTGCAGATGATACAAAAGTTGAAACTGGTGATCAAGGATATACAGTGGTACAA 120 
+ 

21 PVFADDTKVETGDQGYTVVQ 

40 

A 

121 

AGCAAGTATAAGAAAGCTGTTGAACAATTACAAAAAGGAATATTAGATGGAAGTATAACA 180 

41SKyKKAVEQLQKGILDGSIT 

60 

181 

GAAATTAAAGTTTTCTTTGAGGGAACTTTAGCATCTACTATAAAAGTAGGTTCTGAGCTT 24 0 
61EIKVFFEGTLASTIKVGSEL 

80 

241 

AATGCAGCAGATGCAAGTAAATTATTGTTTACACAAGTAGATAATAAACTAGATAATTTA ' 300 

+ + + + + 

81NAADASKLLFTQVDNKLDNL 

100 

301 

GGTGATGGAGATTATGTAGATTTCTTAATAACTTCTCCAGGTCAAGGGGATAAAATAACT 3 60 
101 GDGDYVDFLITSPGQGDKIT 

120 

361 

ACAAGTAAACTTGTTGCATTGAAAGATTTAACAGGTGCTTCAGCAGATGCTATAATTGCT 420 

121TSKLVALKDLTGASADAIIA 

140 

421 

GGAACATCTTCAGCAGATGGTGTTGTTACAAATACTGGAGCTGCTAGTGGTTCTACTGAG 4 80 
141 GTSSADGVVTNTGAASGSTE 

160 
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481 

ACAAATTCAGCAGGAACAAAACTTGCAATGTCAGCTATTTTTGACACAGCATATACAGAT 540 
161 TNSAGTKLAMSAIFDTAYTD 

180 

541 

TCATCTGAAACTGCGGTTAAGATTACTATATW^GCAGATATGAATGATACTAAATTTGGT 600 
181 SSETAVKITIKADMNDTKFG 

200 

601 

aaagcaggtgagacaacttattcaactgggcttacatttgaagatgggtctacag;\aaaa 660 
201 kagettystgltfedgstek 

220 

661 

ATTGTTAAATTAGGGGACAGTGATATTATAGATATAACTAAAGCTCTTAAACTTACTGTT 720 
221 IVKLGDSDI IDITKALKLTV 

240 

721 

GTTCCTGGAAGTAAAGCAACTGTTAAGTTTGCTGAAAAAACACCAAGTGCCAGTGTTCAA 780 
241 VPGSKATVKFAEKTPSASVQ 

260 

781 

CCAGTAATAACAAAGCTTAGAATAATAAATGCTAAAGAAGAAACAATAGATATTGACGCT 84 0 
261 PVITKLRIINAKEETIDIDA 

280 

841 

AGTTCTAGTAAAACAGCACAAGATTTAGCTAAAAAATATGTATTTAATAAAACTGATTTA 900 
281SSSKTAQDLAKKYVFNKTDL 

300 

901 

AATACTCTTTATAAAGTATTAAATGGAGATGAAGCAGATACTAATGGATTAATAGAAGAA 960 
301 NTLYKVLNGDEADTNGLIEE 

320 

961 

GTTAGTGGAAAATATCTU^GTAGTTCTTTATCCAGAAGGAAAAAGAGTTACAACTAAGAGT 1020 
321 VSGKYQVVLYPEGKRVTTKS 

340 

1021 

GCTGCAAAGGCTTCAATTGCTGATGAAAATTCACCAGTTAAATTAACTCTTAAGTCAGAT 1080 
341AAKASIADENSPVKLTLKSD 

360 

♦ 

1081 

AAGAAGAAAGACTTAAAAGATTATGTGGATGATTTAAGAACATATAATAATGGATATTCA 114 0 
361 KKKDLKDYVDDLRTYNNGYS 

380 
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1141 

AATGCTATAGAAGTAGCAGGAGAAGATAGAATAGAAACTGCAATAGCATTAAGTCAAAAA 1200 



381 NAIEVAGEDRIETAIALSQK 

5 400 

1201 

TATTATAACTCTGATGATGAAAATGCTATATTTAGAGATTCAGTTGATAATGTAGTATTG 12 60 



401 YYNSDDENAI FRDSVDNVVL 

10 420 

1261 

GTTGGAGGAAATGCAATAGTTGATGGACTTGTAGCTTCTCCTTTAGCTTCTGAAAAGAAA 1320 



421VGGNAIVDGLVASPLASEKK 

15 440 

1321 

GCTCCTTTATTATTAACTTCA7VAAGATAAATTAGATTCAAGCGTAAAAGCTGAAATAAAG 1380 



441 APLLLTSKDKLDSSVKAEI K 

20 4 60 

1381 

AGAGTTATGAATATAAAGAGTACAACAGGTATAAATACTTCAAAGAAAGTTTATTTAGCT 14 40 



461RVMNIKSTTGINTSKKVYLA 

25 480 

1441 

GGTGGAGTTAATTCTATATCTAAAGAAGTAGAAAATGAATTAAAAGATATGGGACTTAAA 1500 



481 GGVNSISKEVENELKDMGLK 

30 500 

1501 

GTTACAAGATTAGCAGGAGATGATAGATATGAAACTTCTCTAAAAATAGCTGATGAAGTA 15 60 



501 VTRLAGDDRYETSLKIADEV 

35 520 

1561 

GGTCTTGATAATGATAAAGCATTTGTAGTTGGAGGAACAGGATTAGCAGATGCCATGAGT 1620 



521 GLDNDKAFVVGGTGLADAMS 

40 540 

1621 

ATAGCTCCAGT TGCATCTCAATTAAGAAATGCTAATGGT AAAATGGATTTAGCTGATGGT 1680 



45 



541 lAPVASQLRNANGKMDLADG 

560 
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1681 

GATGCTACACCAATAGTAGTTGTAGATGGAAAAGCTAAAACTATAAATGATGATGTAAAA 17 40 
561 DATPIVVVDGKAKTINDDVK 

580 

1741 

GATTTCTTAGATGATTCACAAGTTGATATAATAGGTGGAGAAAACAGTGTATCTAAAGAT 1800 
581 DFLDDSQVDI I GGENSVSKD 

600 

1801 

GTTGAAAATGCAATAGATGATGCTACAGGTAAATCTCCAGATAGAT ATAGTGGAGATGAT 18 60 
601 VENAIDDATGKSPDRYSGDD 

620 

1861 

AGACAAGCAACTAATGCAAAAGTTATAAAAGAATCTTCTTATTATCAAGATAACTTAAAT 1920 
621RQATNAKVIKESSYYQDNLN 

640 

1921 

AATGATAAAAAAGTAGTTAATTTCTTTGTAGCTAAAGATGGTTCTACTAAAGAAGATCAA 1980 
641 NDKKVVNFFVAKDGS TKEDQ 

660 

1981 

TTAGTTGATGCTTTAGCAGCAGCTCCAGTTGCAGCATIACTTTGGTGTAACTCTTAATTCT 2040 
661LVDALAAAPVAANFGVTLNS 

680 

2041 

GATGGTAAGCCAGTAGATAAAGATGGTAAAGtATTAACTGGTTCTGATAATGATAAAT^T 2100 
681DGKPVDKDGKVLTGSDNDKN 

700 

2101 

AAATTAGTATCTCCAGCACCTATAGTATTAGCTACTGATTCTTTATCTTCAGATCaAAGT 2160 
701 KLVSPAPIVLATDSLSSDQS 

720 

2161 

GTATCTATAAGTAaAGTTCTTGATAAAGATAATGGAGAAAACTTAGTTCAAGTTGGTAAA 2220 
721 VS ISKVLDKDNGENLVQVGK 

740 

2221 GGTATAGCTACTTCAGTTATAAACAAAATGAAAGATTTATTAGATATG 22 68 
741 GIATSVINKMKDLLDM 756 



50 
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Appendix 2 

SEQ ID No. 4. Nucleotide sequence of slpA from Clostridium 
difficile strain 172450, PGR type 5, with translation. The 
putative secretory signal cleavage site (A) is indicated, and an 
approximation of the and site of cleavage to form the two mature 
SLPs (♦) is also indicated. 

1 

ATGAAAAAAAGAAATTTAGCAATGGCTATGGCAGCTGTTACTGTAGTAGGTTCTGCTGCT 60 

IMKKRNLAMAMAAVTVVGSAA 

20 

61 

CCAGTTTTTGCAGCAGCTTCAGATGTAATATCACTACAAGATGGTACAAATGATAAGTAT 120 
21PVFAA ASDVISLQDGTNDKY 

40 

121 

ACAGTATCAAATACTAAAGCTAGTGACTTAGTAAAGGATATTTTAGCAGCACAAAACTTA 180 
41 TVSNTKASDLVKDILAAQNL 

60 

181 

ACAACAGGTGCAGTTATTTTGAACAAAGATACAAAAGTTACTTTCTATGATGCAAATGAG 24 0 
eiTTGAVILNKDTKVTFYDANE 

80 

241 

AAAGATTCTTCAACTCCAACTGGAGATAAAAAAGTTTATTCAGAACAAACTTTAACTACA 300 
81KDSSTPTGDKKVYSEQTLTT 

100 

301 

GCTAATGGAAATGAAGATTATGTAAAGACAACTTTAAAAAATTTAGATGCAGGAGAATAT 360 
101 ANGNEDYVKTTLKNLDAGEY 

120 

361 

GCTATTATAGATTTAACTTATAATAATGCTAAAACTGTTGAAATTAAAGTAGTAGCAGCT 420 

121 AI IDLTYNNAKTVEIKVVAA 

140 

421 

AGTGAAAAAACAGTAGTTGTATCTAGTGATGCGAAAAATAGTGCAAAAGATATAGCTGAA 4 80 
141 SEKTVVVSSDAKNSAKDIAE 



481 

AAATATGTGTTTGAAGACAAAGACTTAGAAAATGCACTAAAAACTATAAATGCCTCAGAT 54 0 
161KYVFEDKDLENALKTINASD 
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541 

TTCAGTAAAACTGATAGTTACTATCAAGTAGTTCTTTATCCAAAAGGAAAGAGATTACAA 600 
181 FSKTDSYYQVVLYPKGKRLQ 

200 

601 

GGTTTCTCAACTTATAGAGCTACAAATTATAATGAAGGAACTGCATATGGTAATACACCA 660 
201 GFSTYRATNYNEGTAYGNTP 

220 

♦ 

661 

GTAATATTAACTCT7W\AATCTACTAGTAAGAGTAATTTAAAGACTGCAGTAGAAGAGTTA 720 
221 VILTLKSTSKSNLKTAVEEL 

240 

721 

CAAAAATTGAATGCTAGTTATTCTAATACTACAACTTTAGCTGGTGATGACAGAATACAA 780 
241QKLNASYSNTTTLAGDDRIQ 

260 

781 

ACAGCTATAGAGATAAGTAAAGAATATTACAATAATGATGGCGAGAAATCAGATCATTCA 840 
261 TAIEISKEYYNNDGEKSDHS 

280 

841 

GCTGATGTTAAAGAGAATGTTAAAAATGTTGTATTAGTAGGTGCAAATGCACTAGTAGAT 900 
281 ADVKENVKNVVLVGANALVD 

300 

901 

GGATTAGTTGCGGCTCCTTTAGCAGCAGAAAAAGATGCTCCACTATTATTAACTTCAAAA 960 
301 GLVAAPLAAEKDAPLLLTSK 

320 

961 

GATATVATTAGATTCGTCAGTAAAATCTGAAATAAAGAGAGTTTTAGACTTAAAAACTTCA 1020 
321DKLDSSVKSEIKRVLDLKTS 

340 

1021 

ACAGAAGTAACAGGAAAAACAGTTTATATAGCTGGTGGAGTTAATAGTGTATCTAAAGAA 1080 
341 TEVTGKTVYIAGGVNSVSKE 

360 

1081 

GTTGTAACAGAATTAGAATCAATGGGATTAAAAGTTGAAAGATTCTCAGGTGATGATAGA 1140 
361 VVTELESMGLKVERFSGDDR 

380 

1141 

TATGAAACTTCTTTAAAAATAGCAGGTGAAATAGGCTTAGATAATGATAAGGCTTATGTA 1200 
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5 



10 



15 



20 



400 

1201 

GTTGGTGGAACAGGATTAGCAGATGCCATGAGTATAGCTTCAGTTGCTTCTACTA?^TTA 


1260 


420 

1261 

GATGGTAATGGTGTTGTAGATAGAACAAATGGACATGCTACTCCAATAGTTGTTGTAGAT 


1320 


421DGNGVVDRTNGHATPIVV 

440 


V D 


1321 

GGAAAAGCTGATAAAATATCTGATGACTTAGATAGTTTCTTAGGAAGCGCTGATGTAGAT 


1380 


460 




1440 




480 

1441 
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1681 

CCAGTTGTATTAGCTACTGATTCTTTATCTTCTGATCAATCGGTTGCTATAAGCAAAGTT 17 4 0 



561 PVVLATDSLSSDQSVAISKV 

580 

1741 

GTAGGAGAAAAATATTCTAAAGATTTAACACAAGTTGGTCAAGGAATAGCTAATTCAGTT 1800 



581 VGEKYSKDLTQVGQGIANSV 

600 

1801 ATAAACAAAATGAAAGATTTATTAGATATG 1830 



601 INKMKDLLDM 610 
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Appendix 3 

SEQ ID No. 5. Nucleotide sequence of slpA from Clostridium 
difficile strain 170324, PGR type 12, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 
form the two mature SLPs (♦) are indicated. 



ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 
60 



IMNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 120 



21PVFAATTGTQGYTVVKNDWK 

40 

A 

121 

AAAGCAGTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 180 



41KAVKQLQDGLKDNSIGKITV 

60 

181 

TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 



61SFNDGVVGEVAPKSANKKAD 

80 

241 

AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 



81RDAAAEKLYNLVNTQLDKLG 

100 

301 

GATGGAGATTATGTTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAAT 360 



101 DGDYVDFSVDYNLENKI ITN 

120 

361 

CAAGCAGATGCAGAAGCAATTGTTACAAAGTT7VAATTCACTTAATGAGAAAACTCTTATT 4 20 



121 QADAEAIVTKLNSLNEKTLI 

140 

421 

GATATAGCAACTAAAGATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAA 4 8 0 



141 DIATKDTFGMVSKTQDSEGK 

160 
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481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 
161 NVAATKALKVKDVAT FOLKS 

180 

541 

GGTGGAAGCGAAGATACTGGATATGTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 

181 GGSEDTGYVVEMKAGAVEDK 

200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 
201 YGKVGDSTAGIAINLPSTGL 

220 

661 

GAATATGCAGGTAAAGGAACAACAATTGATTTTAATAAAACTTTAAAAGTTGATGTAACA 720 
221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 780 
241 GGSTPSAVAVSGFVTKDDTD 

260 

781 

TTAGCAAAATCAGGTACTATAAATGTAAGAGTTATAAATGCAAAAGAAGAATCAATTGAT 840 
261 LAKSGTINVRVINAKEESID 

280 

841 

ATAGATGCAAGCTCATATACATCAGCTGAAAATTTAGCTAAAAGATATGTATTTGATCCA 900 
281 IDASSYTSAENLAKRYVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAR.TGATGGTATAGAGTCT 960 
301 DEISEAYKAIVALQNDGIES 

320 

961 

AACTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 1020 
321NLVQLVNGKYQVIFYPEGKR 

340 

1021 

TTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACACCAGCTAAAGTAGTT 108 0 
341 LETKSANDTIASQDTPAKVV 

360 



1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 114 0 
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361IKANKLKDLKDYVDDLKTYN 

380 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 1200 
381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 1260 
401 LSSKYYNSDDKNAITDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 1320 



421 DIVLVGSTSIVDGLVASPLA 



441 SEKTAPLLLTSKDKLDSSVK 

460 

1381 

TCTGAAATAAAGAGAGT TATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAA 14 4 0 
461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAT^C 1500 
481 VYLAGGVNSISKDVENELKN 

500 

1501 

ATGGGTCTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATA 1560 
501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
521 ADEIGLDNDKAFVVGGTGLA 

540 

1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 1680 
541 DAMSIAPVASQLK DGDATPI 

560 



1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 17 40 
561 VVVDGKAKEISDDAKSFLGT 

580 

1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 1800 
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581 SDVDIIGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAAT 18 60 
601 DSATGKTPDRISGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 
621 AEVLKEDDYFTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 
641 AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 2040 
661 AGRFKESPAPIILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAA 2100 
681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATG 
2157 

701 VGKG lASSVINKMKDLLDM 

719 



35 
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Appendix 4 

SEQ ID No 6. Nucleotide sequence of slpA from Clostridium 
difficile strain 171448, PGR type 12, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 
form the two mature SLPs (♦) are indicated. 



ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 60 

IMNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 120 
21PVFAATTGTQGYTVVKNDWK 



121 

AAAGCAGTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 180 
41KAVKQLQDGLKDNSI GKITV 

60 

181 

TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 
61SFNDGVVGEVAPKSANKKAD 

80 

241 

AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 
81RDAAAEKLYNLVNTQLDKLG 

100 

301 

GATGGAGATTATGTTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAAT 360 
101 DGDYVDFSVDYNLENKIITN 

120 

361 

CAAGCAGATGCAGAAGCAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATT 420 
121QADAEAIVTKLNSLNEKTLI 

140 

421 

GATATAGCAACTAT^GATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGGAGGTAAA 480 
141 DIATKDTFGMVSKTQDSGGK 
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AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 
161NVAATKALKVKDVATFGLKS 

180 

541 

GGTGGAAGCGAAGATACTGGATATGTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 
181 GGSE DTGYVVEMKAGAVEDK 

200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 
201 YGKVGDSTAGIAINLPSTGL 

220 

661 

GAATATGCAGGTAAAGGAACAACAATTGATTTTAATAAAACTTTAAAAGTTGATGTAACA 720 
221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 7 80 
241 GGST PSAVAVSGFVTKDDTD 

260 

781 

TTAGCAAAATCAGGTACTATAAATGTAAGAGTTATAAATGCAAAAGAAGAATCAATTGAT 84 0 
261 LAKSGTINVRVINAKEESID 

280 

841 

ATAGATGCAAGCTCATATACATCAGCTGAAAATTTAGCTAAAAGATATGTATTTGATCCA 900 
281 IDASSYTSAENLAKRYVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAATGATGGTATAGAGTCT 960 
301 DEISEAYKAIVALQNDGIES 

320 

961 

AATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 1020 
321 NLVQLVNGKYQVI FYPEGKR 

340 

1021 

TTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACACCAGCTAAAGTAGTT 1080 
341 LETKSANDTIASQDT PAKVV 

360 



1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 1140 
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361 I KANKLKDLKDYVDDLKTYN 

380 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 1200 
381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 



401LSSKYYNSDDKNAITDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 1320 
421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

TCAGAAAAAACAGCTCCATTATTATTAGCTTCAAAAGATAAATTAGATTCATCAGTAAAA 1380 
441 SEKTAPLLLASKDKLDSSVK 

460 

1381 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAA 14 4 0 
461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAAC 1500 
481VYLAGGVNSISKDVENELKN 

500 

1501 

ATGGGTCTTTW^GTTACTAGATTATCAGGAGAAGACAGATACGT^AACTTCTTTAGCAATA 1560 
501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
521 ADE IGLDNDKAFVVGGTGLA 

540 

1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 1680 



541 DAMSIAPVASQLKDGDATPI 

560 



1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 17 4 0 
561 VVVDGKAKEISDDAKSFLGT 

580 

1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 1800 
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581 SDVDI IGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAAT 18 60 

5 + + + + + 

601 DSATGKTPDRISGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 

10 + + + + + 

621 AEVLKEDDYFTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 

15 + + + + + 

641AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 2040 

20 + + + + + 

661 AGRFKESPAPIILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAA 2100 

25 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCT TCT TCAGTTATAAACAAAATGAAAGATTTAT TAGATATG 
2157 

30 + + + + + 

701 VGKGIASSVINKMKDLLDM 

719 



35 
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Appendix 5 

SEQ ID No. 7. Nucleotide sequence of slpA from Clostridium 
difficile strain 171862, PGR type 17, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 
form the two mature SLPs (♦) are indicated. 

1 

ATGAATAAGAAAAACTTAGCAATGGCTATGGCAGCAGTTACTGTTGTGGGTTCTGCAGCG 6 0 

IMNKKNLAMAMAAVTVVGSAA 

20 

61 

CCAATATTTGCAGATAGTACTACGCCAGGTTATACTGTAGTGAAAAATGATTGGAAAAAA 120 
21PIFADSTTPGYTVVKNDWKK 

40 

A 

121 

GCAGTAAAACAATTACAAGATGGGTTGAAAAATAAAACTATATCAACAATAAAGGTGTCT 180 
41AVKQLQDGLKNKT ISTIKVS 

60 

181 

TTTAATGGAAACTCTGTTGGAGAAGTTACACCAGCCAGTTCTGGAGCAAAAAAAGCAGAT 240 
61 FNGNSVGEVTPASSGAKKAD 

80 

241 

AGAGATGCTGCAGCTGAAAAGTTATATAATTTAGTAAATACACAATTAGATAAACTAGGT 300 

+ + + + + 

81RDAAAEKLYNLVNTQLDKLG 

100 

301 

GATGGAGATTACGTTGACTTTGAAGTAACTTATTU^TTTAGCTACTCAAATAATTACAAAA 360 
lOlDGDYVDFEVTYNLATQIITK 

120 

361 

GCAGAAGCAGAGGCAGTTCTTACAAAATTACAACAATATAATGATAAAGTACTTATAAAT 4 20 
121 AEAEAVLTKLQQYNDKVLIN 

140 

421 

TCTGCAACAGATACAGTAAAAGGTATGGTATCTGATACACAA.GTTGATAGCAAAAATGTT 4 80 
141 SATDTVKGMVSDTQVDSKNV 

160 
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481 

GCAGCTAACCCACTTAAAGTTAGTGATATGTATACAATACCATCTGCTATTACTGGAAGT 54 0 
161 AANPLKVSDMYTIPSAITGS 

180 

541 

GATGATTCTGGGTATAGTATTGCTAAACCAACAGAAAAGACTACAaGTTTATTGTATGGT 600 
181 DDSGYSIAKPTEKTTSLLYG 

200 

601 

ACGGTTGGTGATGCAACTGCAGGTAAAGCAATAACAGTAGATACAGCTTCAAATGAAGCT 660 
201 TVGDATAGKAITVDTASNEA 

220 

661 

TTTGCTGGAAATGGAAAGGTTATTGACTACAATAAATCATTCAAAGCAACTGTACAAGGA 7 2 0 
221FAGNGKVIDYNKSFKATVQG 

240 

721 

GATGGAACAGTTAAGACAAGCGGGGTTGTACTTAAAGATGCAAGTGATATGGCTGCAACA 780 
241 DGTVKTSGVVLKDAS DMAAT 

260 

781 

GGTACTATAAAAGTTAGAGTTACAAGTGCAAAAGAAGAATCTATTGATGTGGATTCAAGT 840 
261 GTIKVRVTSAKEESI DVDSS 

280 

841 

TCATATATTAGTGCTGAAAATTTAGCTAAAAAATATGTATTTAATCCTAAAGAGGTTTCT 900 
281 SYI SAENLAK KYVFNPKEVS 

300 

901 

GAAGCTTATAATGCAATAGTTGCATTACAAAATGATGGAATAGAATCTGATTTAGTACAA 960 
301EAYNAIVALQNDGIESDLVQ 

320 

961 

TTAGTTAATGGAAAATATCAAGTTATTTTCTATCCAGAAGGAAAAAGATTAGAAACTAAA 1020 
321 LVNGKYQVI FYPEGKRLETK 

340 



wo 02/062379 



43 



PCT/IE02/00017 



1021 TCTGCAGATATAATAGCTGATGCAGATAGTCCAGCTAAAATAACTATAAAAGCTAATAAA 
1080 

341SADIIADADSPAKITIKANK 

360 

♦ 

1081 

TTAAAAGATTTAAT^GATTATGTAGATGATTTAAAAACATACAATAATACTTACTCAAAT 1140 
361 LKDLKDYVDDLKTYNNTYSN 

380 

1141 

GTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAATT AAGTAGTAAATAT 1200 



381 VVTVAGEDRIETAIELSSKY 

400 

1201 

TATAATTCTGATGATAAAAATGCAATAACTGATGATGCAGTTAAT AATATAGTATTAGTT 1260 
401 YNS DDKNAITDDAVNNIVLV 

420 

1261 

GGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCTTCAGAAAAAACAGCT 1320 
421 GST S IVDGLVAS P LASEKTA 

440 

1321 

CCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAATCTGAGATAAAAAGA 1380 
441 PLLLTSKDKLDSSVKSEIKR 

460 

1381 

GTTATGAACTTAAAGAGTGATACTGGTATAAATACTTCTAAAAAAGTTTATTTAGCTGGT 14 4 0 
461VMNLKSDTGINTSKKVYLAG 

480 

1441 

GGAGTTAATTCTATATCTAAAGATGTAGAAGATGAATTGAAAAATATGGGCCTTAAAGTT 1500 
481 GVNSI SKDVEDELKNMGLKV 

500 

1501 

ACT AGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAAATAGGT 15 60 
501TRLSGEDRYETSLAIADEIG 

520 

1561 

CTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTGGCAGATGCTATGAGTATA 1620 
521LDNDKAFVVGGTGLADAMS 1 

540 

1621 

GCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 1680 
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541 APVASQLKDGDATPIVVVDG 

560 

1681 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATA 1740 

5 + + + + _+ 

561 KAKEISDDAKSFLGTSDVDI 

580 

1741 

ATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGA 1800 

10 + + + + + 

581 IGGKNSVSKEIEESIDSATG 

600 

1801 

AAAACTCCAGATAGAATAAGTGGAGATGACAGACAAGCAACTAATGCTGAAGTTTTAAAA 1860 

15 + + + + + 

601 KTPDRISGDDRQATNAEVLK 

620 

1861 

GAAGATGATTATTTCAAAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCT 1920 

20 + + + + + 

621 EDDYFKDGEVVNYFVAKDGS 

640 

1921 

ACTAAAGAAGATCAATTAGTAGATGCATTAGCAGCAGCACCAATAGCAGGTAGATTTAAG 1980 

25 + + + + + 

641 TKEDQLVDALAAAPIAGRFK 

660 

1981 

GAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCT 2040 

30 + + + + + 

661 ESPAPIILATDTLSSDQNVA 

680 

2041 

GTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATA 2100 

35 + + + + + 

681 VSKAVPKDGGTNLVQVGKGI 

700 

2101 GCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATGTAA 214 5 



40 701 ASSVINKMKDLLDM* 715 
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Appendix 6 

SEQ ID No 8. Nucleotide sequence of slpA from Clostridium 
difficile strain 173644, PCR type 31, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 
form the two mature SLPs (♦) are indicated. 



ATGAAT AAGAAGGATATAGCAATAGCTATGTCAGGATTAACAGTATTAGCTTCTGCAGCA 6 0 

IMNKKDIAIAMSGLTVLASAA 

20 

61 

CCTGTATTTGCTGCTAGTAGTTTTACAGCAGATTATAATTATACTGTAGTGCAAGGAAAA 120 
21 PVFAASS FTADYNYTVVQGK 

40 

A 

121 

TATCAAAAAGTTATAACTGGATTACAAGATGGTTTAAAAAATGGAAAAATAACAAATATT 180 
41 YQK VITGLQDGLKNGKI TNI 

60 

181 

GATGTAATATTTGATGGAAGTTCAATTGGTGAGGTAGTGCCAGGTTCTGATGCTGCAGCT 240 
61DVIFDGSSIGEVVPGSDAAA 

80 

241 

GCAGCTACTAAATTAAAAAGTTTAGTTGATGATAAGTTAGATAACTT AGGTGATGGAAAA 300 
81AATKLKSLVDDKLDNLGDGK 

100 

301 

TACGTTCAATTTAATGTTACTTATACTACTAAATCTATAATAACTAAAGCAGAATTAAAA 360 
101 YVQFNVTYTTKSI ITKAELK 

120 

361 

AATTATTATAATCAATTAGAAAGTAGTAAAGATAGAATACTTATAGGAAATGAACCTCAA 420 
121 NYYNQLESSKDRILIGNEPQ 

140 

421 

GATACAGGAACTAAAGGTCTTATAAAAGCTGATACTGATGGTACTACTGCTGTTGCAGCA 480 
141 DTGTKGLIKADTDGTTAVAA 

160 



481 

GCTGCACCATTGAAATTATCAGATATATTTACGTTTAGTTATGATGAAGTAACAGGTGTA 540 
161 AAPLKLSDI FTFSYDEVTGV 

180 
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541 

CTTAAAGCAGAACCAACAAGTAAAGTAAGCGCTGGTAAAGTTCAAGGTCTAAAATATGGA 600 
181LKAEPTSKVSAGKVQGLKYG 

200 

601 

AATACAGGAGCAACTAACTATACTTCTGGAGCTGAAATATCTGTTCCTACTACAGGCTTA 660 
201 NTGATNYTSGAEISVPTTGL 

220 

661 

ACATTAACTGCTGATACAACTGCAACAACAGATGTAAATATTTCTGATGTTATGAGTGCA 720 
221 TLTADTTATTDVNISDVMSA 

240 

721 

TTTAAATTTAATGGTACTGATACGATTAGTGGATTCCCAGCTGGTTCATCAGCTTCTACT 780 
241 FKFNGTDTISGFPAGSSAST 

260 

781 

CTTAGAGCAAGTATAAAAGTAATAAATGCAAAAGAAGAATCTATAGATGTTGATTCAAGT 84 0 
261 LRASIKVINAKEESIDVDSS 

280 

841 

TCACATAGAACAGCTGAAGATTTAGCTGAAAAATATGTATTTAAACCAGAAGATGTGAAT 900 
281 SHRTAEDLAEKYVFKPEDVN 

300 

901 

AAAACTTATGAGGCACTGACTGATTTATATAAAGAAGGTATAACAAGTAATCTTATCACT 960 
301 KTYEALTDLYKEGITSNLIT 

320 

961 

CAAGATGGTGGAAAATATCAAGTTGTTTTATTTGCTCAAGGAAAGAGATTAACTACTAAA 1020 



321 QDGGKYQVVLFAQGKRLTTK 

340 

1021 

GGAGCAACTGGAACTTTAGCAGATGAAAATTCTCCTCTTAAAGTAACAATAAAAGCAGAT 1080 



341 GATGTLADENSPLKVTIKAD 

360 



1081 

AAAGTAAAAGACTTAAAAGATTATGTTGAAGATTTAAAAAATGCTAACAATGGATATTCA 1140 
361 KVKDLKDYVEDLKNANNGYS 

380 

1141 

AATTCTGTTGTTGTAGCAGGTGAAGATAGAATAGAAACAGCAATAGAGTTAAGTAGCAAA 1200 
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381NSVVVAGEDRIETAIELSSK 

400 

1201 

TACTATAACTCTGATGATGACAATGCAATAACTAAAGATCCAGTTAACAATGTTGTTTTA 1260 
401 YYNSDDDNAITKDPVNNVVL 

420 

1261 

GTTGGTTCTCAAGCTGTAGTTGATGGGCTTGTAGCTTCACCTTTAGCATCTGAAAAAAGA 1320 
421 VGSQAVVDGLVASPLASEKR 

440 

1321 

GCTCCTTTACTATTAACTTCAGCAGGAAAATTAGATTCAAGTGTTAAAGCTGAGTTGAAA 1380 
441APLLLTSAGKLDSSVKAELK 

460 

1381 

AGAGTAATGGATTTAAAATCTACAACAGGTGTAAATACTTCTAAAAAAGTTTACTTAGCT 1440 
461RVMDLKSTTGVNTSKKVYLA 

480 

1441 

GGTGGAGTAAACTCTATATCTAAAGATGTAGAAAATGAATTAAAAGATATGGGACTTAAA 1500 
481 GGVNS ISKDVENELKDMGLK 

500 

1501 

GTTACAAGATTATCAGGAGATGATAGATATGTU^CTTCTTTAGCTATAGCTGATGAAATA 1560 
501 VTR. LSGDDRYETSLAIADEI 

520 

1561 

GGTCTTGATAATGATAAAGCTTTTGTAGTTGGAGGAACAGGATTAGCGGATGCTATGAGT 1620 
521 GLDNDKAFVVGGTGLADAMS 

540 

1621 

ATAGCTCCAGTTGCTTCTCAATTAAGAAACTCAAATGGAGAACTTGACTTAAAAGGTGAT 1680 
541IAPVASQLRNSNGELDLKGD 

560 



1681 

GCAACTCCAATAGTAGTTGTTGATGGAAAAGCTT^GATATAT^TTCTGAAGTAAAAGAT 1740 
561ATPIVVVDGKAKDINSEVKD 

580 

1741 

TTCTTAGATGATTCACAAGTTGATATAATAGGTGGTGTAAATAGTGTTTCTAAAGAAGTA 1800 
581FLDDSQVDIIGGVNSVSKEV 

600 

1801 

ATGGAAGCAATAGATG ATGCTACTGGAAAATCACCTGAGAGATATAGTGGAGAAGATAGA 1860 
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601 MEAIDDATGKSPERYSGEDR 

620 

1861 

CAAGCAACAAATGCTAAAGTTATAAAAGAAGATGATTTCTTTAAAAATGGAGAAGTTACA 1920 
621 QATNAKVIKEDDFFKNGEVT 

640 

1921 

AACTTCTTTGTAGCTAAAGATGGTTCAACTAAAGAAGATCAATTAGTAGATGCTTTAGCA 1980 
641 NFFVAKDGSTKEDQLVDALA 

660 

1981 

GGTGCTGCAATTGCTGGTAACTTTGGTGTAACAGTAGATAATGAAGGAAAACCTACAGTT 2040 
661 GAAIAGNFGVTVDNEGKPTV 

680 

2041 

GCTGATAAAAAAGCTTCTCCAGCACCAATTGTTTTAGCAACAGATTCTTTATCTTCTGAT 2100 
681 ADKKASPAPIVLATDSLSSD 

700 

2101 

CAAAATGTAGCTATAAGTAAAGCTGTT^AATGATGACGCTAATACTAAGAATCTAGTTCAA 2160 
701 QNVAI SKAVNDDANTKNLVQ 

720 

2161 GTTGGTAAAGGTATAGCTACTTCAGTTGTAAGTAAAATAAAAGATTTATTAGATATG 
2217 

721 VGKGIATSVVSKIKDLLDM 

739 



35 
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Appendix 7 

SEQ ID No 9. Nucleotide sequence of slpA from Clostridium difficile 
strain 170444, PGR type 46, with translation. The putative 
secretory signal cleavage site (A) and site of cleavage to form the 
two mature SLPs (♦) are indicated. 

1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 60 

IMNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAft. 12 0 
21PVFAATTGTQGYTVVKNDWK 



121 

AAAGCAGTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 18 0 
41KAVK QLQDGLKDNS I GKI TV 

60 

181 

TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 
61SFNDGVVGEVAPKSANKKAD 

80 

241 

AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 
81RDAAAEKLYNLVNTQLDKLG 

100 

301 

GATGGAGATTATGTTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAAT 360 
101 DGDYVDFSVDYNLEKKIITN 

120 

361 

CAAGCAGATGCRGAAGCAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATT 4 20 
121 QADAEAIVTKLNSLNEKTLl 

140 

421 

GATATAGCAACTAAAGATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAA 480 
141 DIATKDTFGMVSKTQDSEGK 

160 
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AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 540 

5 161NVAATKALKVKDVATFGLKS 
180 

541 

GGTGGAAGCGAAGATACTGGATATGTTATTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 

10 181GGSEDTGYVIEMKAGAVEDK 
200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGT ACTGGACTT 560 

15 201 YGKVGDSTAGIAINLPSTGL 

220 

661 

GAATATGCAGGTAAAGGAACAACAATTGATTTTAATAAAACTTTAAAAGTTGATGTAACA 7 20 

20 221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGT AGCTGTAAGTGGTTTTGTAACTAAAGATGAT ACTGAT 780 

25 241 GGSTPSAVAVSGFVTKDDTD 

260 

781 

TT AGCAAAATCAGGT ACT ATAAATGT AAGAGTTATAAATGCAAAAGAAGAATCAATTGAT 840 

30 261 LAKSGTINVRVINAKEESID 

280 

841 

ATAGATGCAAGCTCATATACATCAGCTGAAAATTTAGCTAAAAGACATGT ATTTGATCCA 90 0 

35 281 IDASSYTSAENLAKRHVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAATGATGGTATAGAGTCT 96 0 

40 301 DEISEAYKAIVALQNDGIES 

320 

961 

AATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 10 20 

45 321 NLVQLVNGKYQVIFYPEGKR 

340 
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1021 TTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACACCAGCTAAAGTAGTT 
1080 

341 LETKSANDTIAS QDTPAKVV 

360 

♦ 

1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 114 0 
361IKANKLKDLKDYVDDLKTYN 

380 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 1200 
381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTT^GTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 1260 
401 LSSKYYNSDDKNAITDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 1320 
421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

TCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 1380 
441SEKTAPLLLTSKDK LDSSVK 

460 

1381 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAA 14 40 
461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAAC 1500 

481 VYLAGGVNSISKDVENELKN 

500 

1501 

ATGGGTCTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATA 1560 
501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
521 ADEIGLDNDKAFVVGGTGLA 

540 



1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 1680 
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541 DAMS lAPVASQLKDGDATPI 

560 

1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 1740 

5 + + + + + 

561 VVVDGKAKEISDDAKSFLGT 

580 

1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 1800 

10 + + + + + 

581 SDVDIIGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGAO^AGCAACTAAT 1860 

15 + + + + + 

601DSATGKTPDRISGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1 920 

20 + + + + + 

621 AEVLKEDDYFT DGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 

25 + + + + + 

641 AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 2040 

30 + + + + + 

661 AGRFKESPAPIILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAA 2100 

35 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATG 
2157 

40 + + + + + 

701VGKGIASSVINKMKDLLDM 

719 



45 



wo 02/062379 



53 



PCT/IE02/00017 



Appendix 8 

SEQ ID No 10. Nucleotide sequence of slpA from Clostridium difficile 
strain 170426, PGR type 92, with translation. The putative 
secretory signal cleavage site (A) and site of cleavage to form the 
two mature SLPs (♦) are indicated. 

1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 6 0 

IMNKKNIAIAMSGLTVLASAA 



CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 120 
21 PVFAATTGTQGYTVVKNDWK 



121 

AAAGCAGTAAAACAATTACAGGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 180 
41 KAV. KQLQDGLKDNSIGKITV 

60 

181 

TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 
61SFNDGVVGEVAPKSANKKAD 

80 

241 

AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 
81 RDAAAEKLYNLVNTQLDKLG 

100 

301 

GATGGAGATTATGTTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAAT 360 
101 DGDYVDFSVDYNLEKKI ITN 

120 

361 

CAAGCAGATGCAGAAGCAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATT 420 
121 QADAEAIVTKL NSLNEKTLI 

140 

421 

GATATAGCAACTAAAGATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAA 4 80 



141DIATKDTFGMV 

160 



481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 
161 NVAATKALKVKDVATFGLKS 

180 
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541 

GGTGGAAGCGAAGATACTGGATATGTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 
181GGSEDTGYVVEMKAGAVEDK 

200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 
201 YGKVGDSTAGIAINLPSTGL 

220 

661 

GAATATGCAGGTAAAGGAACAACAATTGATTTTAATAAAACTTTAAAAGTTGATGTAACA 7 20 
221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 780 
241 GGSTPSAVAVSGFVTKDDTD 

260 

781 

TTAGCAAAATCAGGTACTATAAATGTAAGAGTTATAAATGCAAAAGAAGAATCAATTGAT 84 0 
261 LAKSGTINVRVINAKEESID 

280 

841 

ATAGATGCAAGCTCATATACATCAGCTGAAAATTTAGCTAAAAGATATGTATTTGATCCA 900 
281 I DASSYTSAENLAKRYVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAATGATGGTATAGAGTCT 960 
301 DEI SEAYKAIVALQNDGIES 

320 

961 

AATTTAGTTCAGTTAGTTAATGGAAAATATCT^AGTGATTTTTTATCCAGT^GGTAAAAGA 1020 
321 NLVQLVNGKYQVIFYPEGKR 

340 

1021 

TTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGAT ACACCAGCTAAAGTAGTT 1080 
341LETKSANDTIASQDTPAKVV 

360 



1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 1140 
361IKANKLKDLKDYVDDLKTYN 

380 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 1200 
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381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 
401 LSSKYYNSDDKNAITDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 1320 
421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

TCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 1380 
441 SEKTAPLLLTSKDKLDSSVK 

460 

1381 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAA 14 40 
461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTABAGATGTAGAAAATGAATTGAAAAAC 1500 
481 VYLAGGVNSISKDVENELKN 

500 

1501 

ATGGGTCTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATA 1560 
501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAT^TAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
521 ADEIGLDNDKAFVVGGTGLA 

540 

1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 1680 
541 DAMSIAPVASQLKDGDATPI 

560 



1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 1740 
561 VVVDGKAKEI SDDAKSFLGT 

580 

1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 1800 
581 SDVDI IGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAAT 18 60 
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601 DSATGKTPDRISGDDRQATN 

620 

1851 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 
621 AEVLKEDDYFTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 
641AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 204 0 
661 AGRFKESPAPIILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAA 2100 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATG 
2157 

701 VGKGIASSVINKMKDLLDM 

719 
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Claims 

1. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising a C. difficile gene or a C. difficile peptide/polypeptide 

5 or a derivative or fragment or mutant or variant thereof which is 

immunogenic in humans. 

2. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising a C. difficile gene or C. difficile peptide/polypeptide 

20 or a derivative or fragment or mutant or variant thereof to which 

immunoreactivity is detected in individuals who have recovered from C. 
difficile infection. 

3. A vaccine as claimed in claim 1 or 2 wherein the gene encodes a C. difficile 
15 surface layer protein, SlpA or variant or homologue thereof. 

4. A vaccine as claimed in claim 1 or 2 wherein the peptide/polypeptide is a C. 
difficile surface layer protein, SlpA or variant or homologue thereof. 

20 5. A vaccine as claimed in any of claims 1 to 4 wherein the vaccine comprises a 

chimeric nucleic acid sequence. 

6. A vaccine as claimed in 5 wherein the chimeric nucleic acid sequence is 
derived from the 5' end of the gene, encoding the mature N-terminal moiety 

25 of SlpA from C. difficile. 

7. A vaccine as claimed in any of claims 1 to 4 wherein the vaccine comprises a 
chimeric peptide/polypeptide. 

30 8. A vaccine as claimed in 7 wherein the amino acid sequence of the chimeric 

peptide/polypeptide is derived from the mature N-terminal moiety of SlpA 
from C. difficile. 
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A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains an 
amino acid sequence SEQ ID No.l or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains an 
amino acid sequence SEQ ID No.2 or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.3 or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.4 or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No. 5 or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.6 or a derivative or fragment or mutant or 
variant thereof. 

A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No. 7 or a derivative or fragment or mutant or 
variant thereof. 



16. 

30 



A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.8 or a derivative or fragment or mutant or 
variant thereof. 
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17. A vaccine as claimed in any of claims 1 to 8 wherein tlie vaccine contains a 
nucleotide sequence SEQ ID No.9 or a derivative or fragment or mutant or 
variant thereof. 

18. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No. 10 or a derivative or fragment or mutant or 
veiriant thereof. 

19. A vaccine as claimed in any preceding claim in combination with at least one 
other C. difficile sub-unit. 

20. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising the mature N-terminal moiety of a surface layer 
protein, SlpA of C. difficile or variant or homologue thereof which is 
immunogenic in humans. 

21. A vaccine as claimed in claim 20 wherein the N-terminal moiety of SlpA 
contains an amino acid sequence SEQ ID No. 1. 

22. A vaccine as claimed in claim 20 wherein the N-terminal moiety of SlpA 
contains an amino acid sequence SEQ ID No. 2. 

23. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising an immunodominant epitope derived from a C. 
difficile gene or a C. difficile peptide/polypeptide or a derivative or fragment 
or mutant or variant thereof which is immunogenic in humans. 

24. A vaccine as claimed in any preceding claim comprising a pharmaceutically 
acceptable carrier. 

25. A vaccine as claimed in any preceding claim in combination with a 
pharmacologically suitable adjuvant. 
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26. A vaccine as claimed in claim 25 wherein the adjuvant is interleukin 12. 

27. A vaccine as claimed in claim 25 or 26 wherein the adjuvant is a heat shock 
protein. 

28. A vaccine as claimed in any preceding claim comprising at least one other 
pharmaceutical product. 

29. A vaccine as claimed in claim 28 wherein the pharmaceutical product is an 
antibiotic. 

30. A vaccine as claimed in claim 29 wherein the antibiotic is selected from one 
or more metronidazole, amoxycillin, tetracycline or erythromycin, 
clarithromycin or tinidazole. 

31. A vaccine as claimed in claim 28 wherein the pharmaceutical product 
comprises an acid-suppressing agent such as omeprazole or bismuth salts. 

32. A vaccine as claimed in any preceding claim in a form for oral 
administration. 

33. A vaccine as claimed in any preceding claim in a form for intranasal 
administration. 

34. A vaccine as claimed in any preceding claim in a form for intravenous 
administration. 

35. A vaccine as claimed in any preceding claim in a form for intramuscular 
administration. 



36. 



A vaccine as claimed in any of claims 1 to 35 including a peptide delivery 
system. 
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37. An immunodominant epitope derived from a C. difficile gene or a C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof. 

38. An immunodominant epitope as claimed in claim 37 wherein the C. difficile 
5 peptide/polypeptide contains an amino acid sequence SEQ ID No.l or SEQ 

ID No.2 or a derivative or fragment or mutant or variant thereof. 

39. An immunodominant epitope as claimed in claim 35 wherein the C. difficile 
peptide/polypeptide contains an amino acid sequence SEQ ID No. 3 or SEQ 

10 ID No.4 or SEQ ID No.5 or SEQ ID No.6 or SEQ ID No.7 or SEQ ID No.8 

or SEQ ID No. 9 or SEQ ID No. 10 or a derivative or fragment or mutant or 
variant thereof. 

40. A chimeric nucleic acid sequence derived from the 5' end of the slpA gene 
X5 encoding the mature N-terminal moiety of SlpA from C. difficile which is 

immunogenic in humans. 

41. A chimeric peptide/polypeptide wherein the amino acid sequence of the 
chimeric peptide/polypeptide is derived from the mature N-terminal moiety 

20 of SlpA from C. difficile. 

42. A C. difficile peptide comprising SEQ ID No. 1. 

43. A C. difficile peptide comprising SEQ ID No. 2. 

25 

44. A C. difficile gene comprising SEQ ID No. 3. 

45. A C. difficile gene comprising SEQ ID No. 4. 
30 46. A C. difficile gene comprising SEQ ID No. 5. 



wo 02/062379 PCT/IE02/00017 



47. A C. difficile gene comprising SEQ ID No. 6. 

48. A C. difficile gene comprising SEQ ID No. 7. 

49. A C. difficile gene comprising SEQ ID No. 8. 

50. A C. difficile gene comprising SEQ ID No. 9. 

51. A C. difficile gene comprising SEQ ID No. 10. 

52. The use of a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is immunogenic in 
humans in the preparation of a medicament for use in a method for the 
treatment or prophylaxis of C. difficile infection or C. difficile associated 
disease in a host. 

53. The use as claimed in claim 52 wherein the medicament which is prepared is 
a vaccine as claimed in any of claims 1 to 36. 

54. A method for preparing a vaccine for prophylaxis or treatment of C. difficile 
associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is 
immunogenic in humans; and 

forming a vaccine preparation comprised of said gene or 
peptide/polypeptide or derivative or fragment or mutant or variant, 
which is suitable for administration to a host and which when 
administered raises an immune response. 
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55. A method as claimed in claim 54 wherein the C. difficile peptide/polypeptide 
contains an amino acid sequence SEQ ID No.l or SEQ ID No.2 or a 
derivative or fragment or mutant or variant thereof. 

56. A method as claimed in claim 54 wherein the C. difficile gene contains an 
amino acid sequence SEQ ID No.3 or SEQ ID No.4 or SEQ ID No.5 or SEQ 
ID No.6 or SEQ ID No.7 or SEQ ID No.8 or SEQ ID No.9 or SEQ ID No.lO 
or a derivative or fragment or mutant or variant thereof. 

57. A method for prophylaxis or treatment of C. difficile associated disease, the 
method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is 
immunogenic in humans; 

forming a vaccine preparation comprised of said gene or 
peptide/polypeptide or derivative or fragment or mutant or variant, 
and 

administering the vaccine preparation to a host to raise an immune 
response. 

58. Monoclonal or polyclonal antibodies or fragments thereof, to a C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof 
which is immunogenic in humans. 

59. Monoclonal or polyclonal antibodies or fragments thereof, to C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof 
to which immunoreactivity is detected in individuals who have recovered 
from C. difficile infection. 
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60. Purified antibodies or serum obtained by immunisation of an animal with a 
vaccine according to any of claims 1 to 36. 

61. The use of the antibodies or fragments as claimed in claims 58 and 59 in the 
preparation of a medicament for treatment or prophylaxis of C. difficile 
infection or C. difficile associated disease. 

62. The use of the antibodies or serum as claimed in 60 in the preparation of a 
medicament for treatment or prophylaxis of C. difficile infection or C. 
difficile associated disease. 

63. The use of the antibodies or fragments or serum as claimed in any of claims 
58 to 60 for use in passive immunotherapy for established C. difficile 
infection. 

64. The use of the antibodies or fragment or serum as claimed in any of claims 58 
to 60 for the eradication of C. difficile associated disease. 

65. Use of interleukin 12 as an adjuvant in C. difficile vaccine. 

66. The use of humanised antibodies or serum for passive vaccination of an 
individual with C. difficile infection. 
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SEQUENCE LISTING 

<110> The Provost, Fellows & Scholars of the College of the Holy and Undivided 
Trinity of Queen Elizabeth, near Dublin 

<120> C. difficile vaccine 

<130> TRI002/C/WO 

<160> 10 

<170> Patentin version 3.1 

<210> 1 

<211> 2157 

<212> DNA 

<213> Clostridium difficile 

<400> 1 

















cctgtttttg 




























tcttttaatg 




gggtgaagta 










agagatgctg 


cagctgagaa 


gttatataat 


cttgttaaca 


ctcaattaga 








atgttgattt 


ttctgtagat 


tataatttag 
































tgaaggtaaa 




aatgttgctg 


caacaaaggc 


acttaaagtt 


aaagatgttg 


ctacatttgg 


tttgaagtct 


540 


ggtggaagcg 


aagatactgg 


atatgttgtt 


gaaatgaaag 


caggagctgt 


agaggataag 


600 


tatggtaaag 


ttggagatag 


tacggcaggt 




atcttcctag 


tactggactt 


660 


gaatatgcag 


gtaaaggaac 


aacaattgat 


tttaataaaa 


ctttaaaagt 


tgatgtaaca 


720 


ggtggttcaa 


cacctagtgc 


tgtagctgta 


agtggttttg 


taactaaaga 


tgatactgat 


780 


ttagcaaaat 


caggtactat 


aaatgtaaga 


gttataaatg 


caaaagaaga 


atcaattgat 


840 


atagatgcaa 


gctcatatac 


atcagctgaa 


aatttagcta 


aaagatatgt 


atttgatcca 


900 


gatgaaattt 


ctgaagcata 


taaggcaata 


gtagcattac 


aaaatgatgg 


tatagagtct 


960 


aatttagttc 


agttagttaa 


tggaaaatat 


caagtgattt 


tttatccaga 


aggtaaaaga 


1020 


ttagaaacta 


aatcagcaaa 


tgatacaata 


gctagtcaag 


atacaccagc 


taaagtagtt 


1080 


ataaaagcta 


ataaattaaa 


agatttaaaa 


gattatgtag 


atgatttaaa 


aacatataat 


1140 


aatacttatt 


caaatgttgt 


aacagtagca 


ggagaagata 


gaatagaaac 


tgctatagaa 


1200 


ttaagtagta 


aatattataa 


ttctgatgat 


aaaaatgcaa 


taactgataa 


agcagttaat 


1260 
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1320 
1380 
1440 
1500 
1560 
1620 
aata 1680 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2157 



gatatagtat tagttggatc tacatctata gttgatggtc ttgttgcatc accattagct 
tcagaaaaaa cagctccatt attattaact tcaaaagata aattagattc atcagtaaaa 
tctgaaataa agagagttat gaacttaaag agtgacactg gtataaatac ttctaaaaaa 
gtttatttag ctggtggagt taattctata tctaaagatg tagaaaatga attgaaaaac 
atgggtctta aagttactag attatcagga gaagacagat acgaaacttc tttagcaata 
gctgatgaaa taggtcttga taatgataaa gcatttgtag ttggtggtac tggattagca 
gatgctatga gtatagctcc agttgcttct caacttaaag atggagatgc tactc 
gtagttgtag atggaaaagc aaaagaaata agtgatgatg ctaagagttt cttaggaact 
tctgatgttg atataatagg tggaaaaaat agcgtatcta aagagattga agagtcaata 
gatagtgcaa ctggaaaaac tccagataga ataagtggag atgatagaca 'agcaactaat 
gctgaagttt taaaagaaga tgattatttc acagatggtg aagttgtgaa ttactttgtt 
gcaaaagatg gttctactaa agaagatcaa ttagtagatg ccttagcagc agcaccaata 
gcaggtagat ttaaggagtc tccagctcca atcatactag ctactgatac tttatcttct 
gaccaaaatg tagctgtaag taaagcagtt cctaaagatg gtggaactaa cttagttcaa 
gtaggtaaag gtatagcttc ttcagttata aacaaaatga aagatttatt agatatg 

<210> 2 
<211> 1830 
<212> DNA 

<213> Clostridium difficle 

itgaiaaaaa gaaatttagc aatggctatg gcagctgtta ctgtagtagg ttctgctgct 60 
ccagtttttg cagcagcttc agatgtaata tcactacaag atggtacaaa tgataagtat 120 
acagtatcaa atactaaagc tagtgactta gtaaaggata ttttagcagc acaaaactta 
acaacaggtg cagttatttt gaacaaagat acaaaagtta ctttctatga tgcaaatgag 
aaagattctt caactccaac tggagataaa aaagtttatt cagaacaaac tttaactaca 
gctaatggaa atgaagatta tgtaaagaca actttaaaaa atttagatgc aggagaatat 
gctattatag atttaactta taataatgct aaaactgttg aaattaaagt agtagcagct 
agtgaaaaaa cagtagttgt atctagtgat gcgaaaaata gtgcaaaaga tatagctgaa 
aaatatgtgt ttgaagacaa agacttagaa aatgcactaa aaactataaa tgcctcagat 
ttcagtaaaa ctgatagtta ctatcaagta gttctttatc caaaaggaaa gagattacaa 
ggtttctcaa cttatagagc tacaaattat aatgaaggaa ctgcatatgg taatacacca 
gtaatattaa ctctaaaatc tactagtaag agtaatttaa agactgcagt agaagagtta 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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780 
840 
900 
960 

aaaacttca 1020 



caaaaattga atgctagtta ttctaatact acaactttag ctggtgatga cagaatacaa 
acagctatag agataagtaa agaatattac aataatgatg gcgagaaatc agatcattca 
gctgatgtta aagagaatgt taaaaatgtt gtattagtag gtgcaaatgc actagtagat 
ggattagttg cggctccttt agcagcagaa aaagatgctc cactattatt aacttcaaaa 
gataaattag attcgtcagt aaaatctgaa ataaagagag ttttagactt 
acagaagtaa caggaaaaac agtttatata gctggtggag ttaatagtgt atctaaagaa 
gttgtaacag aattagaatc aatgggatta aaagttgaaa gattctcagg tgatgataga 
tatgaaactt ctttaaaaat agcaggtgaa ataggcttag ataatgataa ggcttatgta 
gttggtggaa caggattagc agatgccatg agtatagctt cagttgcttc tactaaatta 12 60 
gatggtaatg gtgttgtaga tagaacaaat ggacatgcta ctccaatagt tgttgtagat 1320 
ggaaaagctg ataaaatatc tgatgactta gatagtttct taggaagcgc tgatgtagat 1380 
ataataggtg gatttgcaag tgtatctgaa aagatggaag aagc 



1080 

1140 
1200 



atatc agatgctact 1440 

ggtaaaggcg ttacaagagt taaaggcgac gatagacaag acactaactc tgaagttata 1500 

aaaacatatt atgctaatga tactgaaata gctaaagctg cagttttaga taaagattca 1560 

ggtgcttr.aa gtagtgatgc aggagtattt aatttctatg tagctaaaga tggatctaca 1620 

aaagaagatc aattagttga tgcattagca gtaggagctg ttgctggata taaacttgct 1680 

ccagttgtat tagctactga ttctttatct tctgatcaat cggttgctat aagcaaagtt 1740 
gtaggagaaa aatattctaa agatttaaca caagttggtc aaggaatagc taattcagtt 
ataaacaaaa tgaaagattt attagatatg 

<210> 3 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 

<400> 3 

atgaataaga aaaatatagc aatagctatg tcaggtttaa cagttttagc ttcggctgct 60 
cctgtttttg ctgcaactac tggaacacaa ggttatactg tagttaaaaa cgactggaaa 120 

180 
240 
300 
360 
420 
480 



1800 
1830 



aaagcagtaa aacaattaca agatggacta aaagataata gtataggaaa gataactgta 
tcttttaatg atggggttgt gggtgaagta gctcctaaaa gtgctaataa gaaagcggac 
agagatgctg cagctgagaa gttatataat cttgttaaca ctcaattaga taaattaggt 
gatggagatt atgttgattt ttctgtagat tataatttag aaaacaaaat aataactaat 
caagcagatg cagaagcaat tgttacaaag ttaaattcac ttaatgagaa aactcttatt 
gatatagcaa ctaaagatac ttttggaatg gttagtaaaa cacaagatag tgaaggtaaa 
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aatgttgctg caacaaaggc acttaaagtt aaagatgttg ctacatttgg tttgaagtct 540 

ggtggaagcg aagatactgg atatgttgtt gaaatgaaag caggagctgt agaggataag 600 

tatggtaaag ttggagatag tacggcaggt attgcaataa atcttcctag tactggactt 660 

gaatatgcag gtaaaggaac aacaattgat tttaataaaa ctttaaaagt tgatgtaaca 720 

ggtggttcaa cacctagtgc tgtagctgta agtggttttg taactaaaga tgatactgat 780 

ttagcaaaat caggtactat aaatgtaaga gttataaatg caaaagaaga atcaattgat 840 

atagatgcaa gctcatatac atcagctgaa aatttagcta aaagatatgt atttgatcca 900 

gatgaaattt ctgaagcata taaggcaata gtagcattac aaaatgatgg tatagagtct 960 

aacttagttc agttagttaa tggaaaatat caagtgattt tttatccaga aggtaaaaga 1020 

ttagaaacta aatcagcaaa tgatacaata gctagtcaag atacaccagc taaagtagtt 1080 

ataaaagcta ataaattaaa agatttaaaa gattatgtag atgatttaaa aacatataat 1140 

aatacttatt caaatgttgt aacagtagca ggagaagata gaatagaaac tgctatagaa 1200 

ttaagtagta aatattataa ttctgatgat aaaaatgcaa taactgataa agcagttaat 1260 

gatatagtat tagttggatc tacatctata gttgatggtc ttgttgcatc accattagct 1320 

tcagaaaaaa cagctccatt attattaact tcaaaagata aattagattc atcagtaaaa 1380 

tctgaaataa agagagttat gaacttaaag agtgacactg gtataaatac ttctaaaaaa 1440 

gtttatttag ctggtggagt taattctata tctaaagatg tagaaaatga attgaaaaac 1500 

atgggtctta aagttactag attatcagga gaagacagat acgaaacttc tttagcaata 1560 

gctgatgaaa taggtcttga taatgataaa gcatttgtag ttggtggtac tggattagca 1620 

gatgctatga gtatagctcc agttgcttct caacttaaag atggagatgc tactccaata 1680 

gtagttgtag atggaaaagc aaaagaaata agtgatgatg ctaagagttt cttaggaact 1740 

tctgatgttg atataatagg tggaaaaaat agcgtatcta aagagattga agagtcaata 1800 

gatagtgcaa ctggaaaaac tccagataga ataagtggag atgatagaca agcaactaat 1860 

gctgaagttt taaaagaaga tgattatttc acagatggtg aagttgtgaa ttactttgtt 1920 

gcaaaagatg gttctactaa agaagatcaa ttagtagatg ccttagcagc agcaccaata 1980 

gcaggtagat ttaaggagtc tccagctcca atcatactag ctactgatac tttatcttct 2040 

gaccaaaatg tagctgtaag taaagcagtt cctaaagatg gtggaactaa cttagttcaa 2100 

gtaggtaaag gtatagcttc ttcagttata aacaaaatga aagatttatt agatatgg 2158 

<210> 4 
<211> 2271 
<212> DNA 

<213> Clostridium difficile 
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<400> 4 

atgaataaga aaaatatagc 
cctgtatttg cagatgatac 
agcaagtata agaaagctgt 
gaaattaaag ttttctttga 
aatgcagcag atgcaagtaa 
ggtgatggag attatgtaga 
acaagtaaac ttgttgcatt 
ggaacatctt cagcagatgg 
acaaattcag caggaacaaa 
tcatctgaaa ctgcggttaa 
aaagcaggtg agacaactta 
attgttaaat taggggacag 
gttcctggaa gtaaagcaac 
ccagtaataa caaagcttag 
agttctagta aaacagcaca 
aatactcttt ataaagtatt 
gttagtggaa aatatcaagt 
gctgcaaagg cttcaattgc 
aagaagaaag acttaaaaga 
aatgctatag aagtagcagg 
tattataact ctgatgatga 
gttggaggaa atgcaatagt 
gctcctttat tattaacttc 
agagttatga atataaagag 
ggtggagtta attctatatc 
gttacaagat tagcaggaga 
ggtcttgata atgataaagc 
atagctccag ttgcatctca 
gatgctacac caatagtagt 
gatttcttag atgattcaca 



aatagctatg 
aaaagttgaa 
tgaacaatta 
gggaacttta 
attattgttt 
tttcttaata 
gaaagattta 
tgttgttaca 
acttgcaatg 
gattactata 
ttcaactggg 
tgatattata 
tgttaagttt 
aataataaat 
agatttagct 
aaatggagat 
agttctttat 
tgatgaaaat 
ttatgtggat 
agaagataga 
aaatgctata 
tgatggactt 
aaaagataaa 
tacaacaggt 
taaagaagta 
tgatagatat 
atttgtagtt 
attaagaaat 
tgtagatgga 
agttgatata 



tcaggtttaa 
actggtgatc 
caaaaaggaa 
gcatctacta 
acacaagtag 
acttctccag 
acaggtgctt 
aatactggag 
tcagctattt 
aaagcagata 
cttacatttg 
gatataacta 
gctgaaaaaa 
gctaaagaag 
aaaaaatatg 
gaagcagata 
ccagaaggaa 
tcaccagtta 
gatttaagaa 
atagaaactg 
tttagagatt 
gtagcttctc 
ttagattcaa 
ataaatactt 
gaaaatgaat 
gaaacttctc 
ggaggaacag 
gctaatggta 
aaagctaaaa 
ataggtggag 



cagttttagc 
aaggatatac 
tattagatgg 
taaaagtagg 
ataataaact 
gtcaagggga 
cagcagatgc 
ctgctagtgg 
ttgacacagc 
tgaatgatac 
aagatgggtc 
aagctcttaa 
caccaagtgc 
aaacaataga 
tatttaataa 
ctaatggatt 
aaagagttac 
aattaactct 
catataataa 
caatagcatt 
cagttgataa 
ctttagcttc 
gcgtaaaagc 
caaagaaagt 
taaaagatat 
taaaaatagc 
gattagcaga 
aaatggattt 
ctataaatga 
aaaacagtgt 



ttcggctgca 
agtggtacaa 
aagtataaca 
ttctgagctt 
agataattta 
taaaataact 
tataattgct 
ttctactgag 
atatacagat 
taaatttggt 
tacagaaaaa 
acttactgtt 
cagtgttcaa 
tattgacgct 
aactgattta 
aatagaagaa 
aactaagagt 
taagtcagat 
tggatattca 
aagtcaaaaa 
tgtagtattg 
tgaaaagaaa 
tgaaataaag 
ttatttagct 
gggacttaaa 
tgatgaagta 
tgccatgagt 
agctgatggt 
tgatgtaaaa 
atctaaagat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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gttgaaaatg caatagatga tgctacaggt 
agacaagcaa ctaatgcaaa agttataaaa 
aatgataaaa aagtagttaa tttctttgta 
ttagttgatg ctttagcagc agctccagtt 
gatggtaagc cagtagataa agatggtaaa 
aaattagtat ctccagcacc tatagtatta 
gtatctataa gtaaagttct tgataaagat 
ggtatagcta cttcagttat aaacaaaatg 

<210> 5 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 
<400> 5 

atgaataaga aaaatatagc aatagctatg 
cctgtttttg ctgcaactac tggaacacaa 
aaagcagtaa aacaattaca agatggacta 
tcttttaatg atggggttgt gggtgaagta 
agagatgctg cagctgagaa gttatataat 
gatggagatt atgttgattt ttctgtagat 
caagcagatg cagaagcaat tgttacaaag 
gatatagcaa ctaaagatac ttttggaatg 
aatgttgctg caacaaaggc acttaaagtt 
ggtggaagcg aagatactgg atatgttgtt 
tatggtaaag ttggagatag tacggcaggt 
gaatatgcag gtaaaggaac aacaattgat 
ggtggttcaa cacctagtgc tgtagctgta 
ttagcaaaat caggtactat aaatgtaaga 
atagatgcaa gctcatatac atcagctgaa 
gatgaaattt ctgaagcata taaggcaata 
aatttagttc agttagttaa tggaaaatat 
ttagaaacta aatcagcaaa tgatacaata 
ataaaagcta ataaattaaa agatttaaaa 
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aaatctccag atagatatag tggagatgat 18 60 

gaatcttctt attatcaaga taacttaaat 1920 

gctaaagatg gttctactaa agaagatcaa 1980 

gcagcaaact ttggtgtaac tcttaattct 2040 

gtattaactg gttctgataa tgataaaaat 2100 

gctactgatt ctttatcttc agatcaaagt 2160 

aatggagaaa acttagttca agttggtaaa 2220 

aaagatttat tagatatgta a 2271 



tcaggtttaa cagttttagc ttcggctgct 60 

ggttatactg tagttaaaaa cgactggaaa 120 

aaagataata gtataggaaa gataactgta 180 

gctcctaaaa gtgctaataa gaaagcggac 240 

cttgttaaca ctcaattaga taaattaggt 300 

tataatttag aaaacaaaat aataactaat 360 

ttaaattcac ttaatgagaa aactcttatt 420 

gttagtaaaa cacaagatag tggaggtaaa 480 

aaagatgttg ctacatttgg tttgaagtct 540 

gaaatgaaag caggagctgt agaggataag 600 

attgcaataa atcttcctag tactggactt 660 

tttaataaaa ctttaaaagt tgatgtaaca 720 

agtggttttg taactaaaga tgatactgat 780 

gttataaatg caaaagaaga atcaattgat 840 

aatttagcta aaagatatgt atttgatcca 900 

gtagcattac aaaatgatgg tatagagtct 960 

caagtgattt tttatccaga aggtaaaaga 1020 

gctagtcaag atacaccagc taaagtagtt 1080 

gattatgtag atgatttaaa aacatataat 1140 
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aatacttatt 


caaatgttgt 


aacagtagca 


ggagaagata 


gaatagaaac 


tgctatagaa 


1200 


ttaagtagta 


aatattataa 


ttctgatgat 


aaaaatgcaa 


taactgataa 


agcagttaat 


1260 


gatatagtat 


tagttggatc 


tacatctata 


gttgatggtc 


ttgttgcatc 


accattagct 


1320 


tcagaaaaaa 


cagctccatt 


attattagct 


tcaaaagata 


aattagattc 


atcagtaaaa 


1380 


tctgaaataa 


agagagttat 


gaacttaaag 


agtgacactg 


gtataaatac 


ttctaaaaaa 


1440 


gtttatttag 


ctggtggagt 


taattctata 


tctaaagatg 


tagaaaatga 


attgaaaaac 


1500 


atgggtctta 


aagttactag 


attatcagga 


gaagacagat 


acgaaacttc 


tttagcaata 


1560 


gctgatgaaa 


taggtcttga 


taatgataaa 


gcatttgtag 


ttggtggtac 


tggattagca 


1620 


gatgctatga 


gtatagctcc 


agttgcttct 


caacttaaag 


atggagatgc 


tactccaata 


1680 


gtagttgtag 


atggaaaagc 


aaaagaaata 


agtgatgatg 


ctaagagttt 


cttaggaact 


1740 


tctgatgttg 


atataatagg 


tggaaaaaat 


agcgtatcta 


aagagattga 


agagtcaata 


1800 


gatagtgcaa 


ctggaaaaac 


tccagataga 


ataagtggag 


atgatagaca 


agcaactaat 


1860 


gctgaagttt 


taaaagaaga 


tgattatttc 


acagatggtg 


aagttgtgaa 






gcaaaagatg 


gttctactaa 


agaagatcaa 


ttagtagatg 


ccttagcagc 


agcaccaata 


1980 


gcaggtagat 


ttaaggagtc 


tccagctcca 


atcatactag 


ctactgatac 


tttatcttct 


2040 


gaccaaaatg 


tagctgtaag 


taaagcagtt 


cctaaagatg 


gtggaactaa 


cttagttcaa 


2100 


gtaggtaaag 


gtatagcttc 


ttcagttata 


aacaaaatga 


aagatttatt 


agatatgg 




<210> 6 
<211> 2217 
<212> DNA 

<213> Clostridium difficile 










<400> 6 
atgaataaga 


aggatatagc 


aatagctatg 


tcaggattaa 


cagtattagc 


ttctgcagca 




cctgtatttg 


ctgctagtag 


ttttacagca 


gattataatt 


atactgtagt 


gcaaggaaaa 




tatcaaaaag 


ttataactgg 


attacaagat 


ggtttaaaaa 


atggaaaaat 


aacaaatatt 




gatgtaatat 


ttgatggaag 


ttcaattggt 


gaggtagtgc 


caggttctga 


tgctgcagct 




gcagctacta 


aattaaaaag 


tttagttgat 


gataagttag 


ataacttagg 


tgatggaaaa 




tacgttcaat 


ttaatgttac 


ttatactact 


aaatctataa 


taactaaagc 


agaattaaaa 




aattattata 


atcaattaga 


aagtagtaaa 


gatagaatac 


ttataggaaa 


tgaacctcaa 


420 


gatacaggaa 


ctaaaggtct 


tataaaagct 


gatactgatg 


gtactactgc 


tgttgcagca 


480 


gctgcaccat 


tgaaattatc 


agatatattt 


acgtttagtt 


atgatgaagt 


aacaggtgta 


540 


cttaaagcag 


aaccaacaag 


taaagtaagc 


gctggtaaag 


ttcaaggtct 


aaaatatgga 


600 
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aatacaggag 


caactaacta 






ctgttcctac 


tacaggctta 


660 


acattaactg 


ctgatacaac 






tttctgatgt 


tatgagtgca 


720 


tttaaattta 


atggtactga 






ctggttcatc 


agcttctact 


780 


cttagagcaa 


gtataaaagt 






ctatagatgt 


tgattcaagt 


840 


tcacatagaa 


cagctgaaga 


tttagctgaa 




ttaaaccaga 


agatgtgaat 


900 


aaaacttatg 


aggcactgac 


tgatttatat 






tcttatcact 


960 


caagatggtg 


gaaaatatca 


agttgtttta 






aactactaaa 


1020 


ggagcaactg gaactttagc 


agatgaaaat 


tctcctctta 




aaaagcagat 


1080 


aaagtiaaaag 




ttatgttgaa 


gatttaaaaa 


atgctaacaa 


tggatattca 


1140 




ttgtagcagg 


tgaagataga 




caatagagtt 


aagtagcaaa 


1200 


tactataact 


ctgatgatga 


caatgcaata 






tgttgtttta 


1260 


gttggttctc 




tgatgggctt 




ctttagcatc 


iigaaaaaaga 


1320 




tattaacttc 


agcaggaaaa 


ttagattcaa 


gtgttaaagc 


tgagttgaaa 


1380 


agagtaatgg 


atttaaaatc 


tacaacaggt 






ttacttagct 


1440 


ggtggagtaa 




taaagatgta 






gggacttaaa 


1500 


gttacaagat 




tgatagatat 






tgatgaaata 


1560 


ggtcttgata 


atgataaagc 


ttttgtagtt 






tgctatgagt 


1620 


atagctccag 


ttgcttctca 




tcaaatggag 


aacttgactt 


aaaaggtgat 


1680 


gcaactccaa 


tagtagttgt 






taaattctga 


agtaaaagat 


1740 


ttcttagatg 


attcacaagt 


tgatataata 




atagtgtttc 


taaagaagta 


1800 


atggaagcaa 


tagatgatgc 


tactggaaaa 




gatatagtgg 


agaagataga 


1860 


caagcaacaa 


atgctaaagt 


tataaaagaa 




ttaaaaatgg 


agaagttaca 


1920 


aacttctttg 


tagctaaaga 


tggttcaact 




aattagtaga 


tgctttagca 


1980 


ggtgctgcaa 


ttgctggtaa 


ctttggtgta 






acctacagtt 


2040 


gctgataaaa 


aagcttctcc 


agcaccaatt 


gttttagcaa 


cagattcttt 


atcttctgat 


2100 


caaaatgtag 


ctataagtaa 


agctgtaaat 


gatgacgcta 


atactaagaa 


tctagttcaa 


2160 


gttggtaaag 


gtatagctac 


ttcagttgta 


agtaaaataa 


aagatttatt 


agatatg 


2217 



<210> 7 

<211> 2145 

<212> DNA 

<213> Clostridium difficile 



wo 02/062379 PCT/IE02/00017 
9/11 



<400> 7 



atgaataaga 


aaaacttagc 








ttctgcagcg 


60 


ccaatatttg 


cagatagtac 








ttggaaaaaa 


120 


gcagtaaaac 


aattacaaga 


tgggttgaaa 






aaaggtgtct 


180 


tttaatggaa 


actctgttgg 


agaagttaca 








240 


agagatgctg 


cagctgaaaa 


gttatataat 


ttagtaaata 


cacaattaga 


taaactaggt 


300 


gatggagatt 


acgttgactt 




tataatttag 


ctactcaaat 


aattacaaaa 


360 


gcagaagcag 


aggcagttct 






atgataaagt 


acttataaat 


420 


tctgcaacag 


atacagtaaa 






aagttgatag 




480 


gcagctaacc 


cacttaaagt 


tagtgatatg 




catctgctat 


tactggaagt 


540 


gatgattctg 


ggtatagtat 


tgctaaacca 






attgtatggt 


600 


acggttggtg 


atgcaactgc 


aggtaaagca 






aaatgaagct 


660 


tttgctggaa 


atggaaaggt 


tattgactac 






tgtacaagga 


720 


gatggaacag 


ttaagacaag 


cggggttgta 




caagtgatat 


ggctgcaaca 


780 


ggtactataa 


aagttagagt 








ggattcaagt 


840 


tcatatatta 


gtgctgaaaa 


tttagctaaa 






agaggtttct 


900 


gaagcttata 


atgcaatagt 


tgcattacaa 






tttagtacaa 


960 


ttagttaatg 


gaaaatatca 








agaaactaaa 


1020 


tctgcagata 


taatagctga 


tgcagatagt 




"taactataaa 


agctaataaa 


1080 


ttaaaagatt 


taaaagatta 






acaataatac 


ttactcaaat 


1140 


gttgtaacag 


tagcaggaga 


agatagaata 




tagaattaag 


tagtaaatat 


1200 


tataattctg 


atgataaaaa 






ttaataatat 


agtattagtt 


1260 


ggatctacat 


ctatagttga 


tggtcttgtt 




tagcttcaga 


aaaaacagct 


1320 


ccattattat 


taacttcaaa 


agataaatta 




taaaatctga 




1380 


gttatgaact 


taaagagtga 


tactggtata 


aatacttcta 




tttagctggt 


1440 


ggagttaatt 


ctatatctaa 


agatgtagaa 


gatgaattga 






1500 


actagattat 


caggagaaga 


cagatacgaa 


acttctttag 


caatagctga 


tgaaataggt 


1560 


cttgataatg 


ataaagcatt 


tgtagttggt 






tatgagtata 


1620 


gctccagttg 


cttctcaact 


taaagatgga 


gatgctactc 


caatagtagt 


tgtagatgga 


1680 


aaagcaaaag 


aaataagtga 


tgatgctaag 


agtttcttag 


gaacttctga 


tgttgatata 


1740 


ataggtggaa 


aaaatagcgt 


atctaaagag 


attgaagagt 


caatagatag 


tgcaactgga 


1800 
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1860 
1920 
1980 
2040 



aaaactccag atagaataag tggagatgac agacaagcaa ctaatgctga agttttaaaa 
gaagatgatt atttcaaaga tggtgaagtt gtgaattact ttgttgcaaa agatggttct 
actaaagaag atcaattagt agatgcatta gcagcagcac caatagcagg tagatttaag 
gagtctccag ctccaatcat actagctact gatactttat cttctgacca aaatgtagct 

gtaagtaaag cagttcctaa agatggtgga actaacttag ttcaagtagg taaaggtata 2100 
gcttcttcag ttataaacaa aatgaaagat ttattagata tgtaa 2145 

<210> 8 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 
<400> 8 

atgaataaga aaaatatagc aatagctatg tcaggtttaa cagttttagc ttcggctgct 60 
cctgtttttg ctgcaactac tggaacacaa ggttatactg tagttaaaaa cgactggaaa 
aaagcagtaa aacaattaca agatggacta aaagataata gtataggaaa gataactgta 
tcttttaatg atggggttgt gggtgaagta gctcctaaaa gtgctaataa gaaagcggac 240 
agagatgctg cagctgagaa gttatataat cttgttaaca ctcaattaga taaattaggt 
gatggagatt atgttgattt ttctgtagat tataatttag aaaaaaaaat aataactaat 
caagcagatg cagaagcaat tgttacaaag ttaaattcac ttaatgagaa aactcttatt 
gatatagcaa ctaaagatac ttttggaatg gttagtaaaa cacaagatag tgaaggtaaa 
aatgttgctg caacaaaggc acttaaagtt aaagatgttg ctacatttgg tttgaagtct 
ggtggaagcg aagatactgg atatgttatt gaaatgaaag caggagctgt agaggataag 
tatggtaaag ttggagatag tacggcaggt attgcaataa atcttcctag tactggactt 
gaatatgcag gtaaaggaac aacaattgat tttaataaaa ctttaaaagt tgatgtaaca 7 20 
ggtggttcaa cacctagtgc tgtagctgta agtggttttg taactaaaga tgatactgat 7 80 
ttagcaaaat caggtactat aaatgtaaga gttataaatg caaaagaaga atcaattgat 840 
atagatgcaa gctcatatac atcagctgaa aatttagcta aaagacatgt atttgatcca 900 
gatgaaattt ctgaagcata taaggcaata gtagcattac aaaatgatgg tatagagtct 960 
aatttagttc agttagttaa tggaaaatat caagtgattt tttatccaga aggtaaaaga 
ttagaaacta aatcagcaaa tgatacaata gctagtcaag atacaccagc taaagtagtt 
ataaaagcta ataaattaaa agatttaaaa gattatgtag atgatttaaa aacatataat 
aatacttatt caaatgttgt aacagtagca ggagaagata gaatagaaac tgctatagaa 
ttaagtagta aatattataa ttctgatgat aaaaatgcaa taactgataa agcagttaat 



120 
180 



300 
360 
420 
480 
540 
600 
660 



1020 
1080 
1140 
1200 
1260 
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gatatagtat tagttggatc tacatctata gttgatggtc ttgttgcatc accattagct 1320 

tcagaaaaaa cagctccatt attattaact tcaaaagata aattagattc atcagtaaaa 1380 

tctgaaataa agagagttat gaacttaaag agtgacactg gtataaatac ttctaaaaaa 1440 

gtttatttag ctggtggagt taattctata tctaaagatg tagaaaatga attgaaaaac 1500 

atgggtctta aagttactag attatcagga gaagacagat acgaaacttc tttagcaata 1560 

gctgatgaaa taggtcttga taatgataaa gcatttgtag ttggtggtac tggattagca 1620 

gatgctatga gtatagctcc agttgcttct caacttaaag atggagatgc tactccaata 1680 

gtagttgtag atggaaaagc aaaagaaata agtgatgatg ctaagagttt cttaggaact 1740 

tctgatgttg atataatagg tggaaaaaat agcgtatcta aagagattga agagtcaata 1800 

gatagtgcaa ctggaaaaac tccagataga ataagtggag atgatagaca agcaactaat 1860 
gctgaagttt taaaagaaga tgattatttc acagatggtg aagttgtgaa ttactttgtt 
gcaaaagatg gttctactaa agaagatcaa ttagtagatg ccttagcagc agcaccaata 
gcaggtagat ttaaggagtc tccagctcca atcatactag ctactgatac tttatcttct 
gaccaaaatg tagctgtaag taaagcagtt cctaaagatg gtggaactaa cttagttcaa 

gtaggtaaag gtatagcttc ttcagttata aacaaaatga aagatttatt agatatga 2158 



<210> 9 

<211> 20 

<212> PRT 

<213> Clostridium difficile 

<400> 9 

Asp Lys Thr Lys Val Glu Thr Ala Asp Gin Gly Tyr Thr Val Val Gin 
1 5 10 15 



Ser Lys Tyr Lys 
20 

<210> 10 
<211> 20 
<212> PRT 

<213> Clostridium difficile 
<400> 10 

Ala Thr Thr Gly Thr Gin Gly Tyr Thr Val Val Lys Asn Asp Gly Lys 



1920 
1980 
2040 
2100 



Lys Ala Val Lys 
20 
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SEQ ID No. 1 
(Strain 171500) 

DKTKVETADQGYTVVQSKYK 

SEQ ID No. 2 
(Strain 170324) 

ATTTGTQGYTWKNDGKKAVK 



SEQ ID No. 3 
(Strain 171500 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCACCT 

GTATTTGCAGATGATACAAAAGTTGAAACTGGTGATCAAGGATATACAGTGGTACAAAGCAA 

GTATAAGAAAGCTGTTGAACAATTACAAAAAGGAATATTAGATGGAAGTATAACAGAAATTA 

AAGTTTTCTTTGAGGGAACTTTAGCATCTACTATAAAAGTAGGTTCTGAGCTTAATGCAGCAG 

ATGCAAGTAAATTATTGTTTACACAAGTAGATAATAAACTAGATAATTTAGGTGATGGAGATT 

ATGTAGATTTCTTAATAACTTCTCCAGGTCAAGGGGATAAAATAACTACAAGTAAACTTGTTG 

CATTGAAAGATTTAACAGGTGCTTCAGCAGATGCTATAATTGCTGGAACATCTTCAGCAGATG 

GTGTTGTTACAAATACTGGAGCTGCTAGTGGTTCTACTGAGACAAATTCAGCAGGAACAAAAC 

TTGCAATGTCAGCTATTTTTGACACAGCATATACAGATTCATCTGAAACTGCGGTTAAGATTA 

CTATAAAAGCAGATATGAATGATACTAAATTTGGTAAAGCAGGTGAGACAACTTATTCAACTG 

GGCTTACATTTGAAGATGGGTCTACAGAAAAAATTGTTAAATTAGGGGACAGTGATATTATAG 

ATATAACTAAAGCTCTTAAACTTACTGTTGTTCCTGGAAGTAAAGCAACTGTTAAGTTTGCTG 

AAAAAACACCAAGTGCCAGTGTTCAACCAGTAATAACAAAGCTTAGAATAATAAATGCTAAA 

GAAGAAACAATAGATATTGACGCTAGTTCTAGTAAAACAGCACAAGATTTAGCTAAAAAATA 

TGTAnTAATAAAACTGATTTAAATACTCTTTATAAAGTATTAAATGGAGATGAAGCAGATAC 

TAATGGATTAATAGAAGAAGTTAGTGGAAAATATCAAGTAGTTCTTTATCCAGAAGGAAAAA 

GAGTTACAACTAAGAGTGCTGCAAAGGCTTCAATTGCTGATGAAAATTCACCAGTTAAATTAA 

CTCTTAAGTCAGATAAGAAGAAAGACTTAAAAGATTATGTGGATGATTTAAGAACATATAAT 

AATGGATATTCAAATGCTATAGAAGTAGCAGGAGAAGATAGAATAGAAACTGCAATAGCATT 

AAGTCAAAAATATTATAACTCTGATGATGAAAATGCTATATTTAGAGATTCAGTTGATAATGT 

AGTATTGGTTGGAGGAAATGCAATAGTTGATGGACTTGTAGCTTCTCCTTTAGCTTCTGAAAA 

GAAAGCTCCTTTATTATTAACTTCAAAAGATAAATTAGATTCAAGCGTAAAAGCTGAAATAAA 

GAGAGTTATGAATATAAAGAGTACAACAGGTATAAATACTTCAAAGAAAGTTTATTTAGCTG 

GTGGAGTTAATTCTATATCTAAAGAAGTAGAAAATGAATTAAAAGATATGGGACTTAAAGTT 

ACAAGATTAGCAGGAGATGATAGATATGAAACTTCTCTAAAAATAGCTGATGAAGTAGGTCT 

TGATAATGATAAAGCATTTGTAGTTGGAGGAACAGGATTAGCAGATGCCATGAGTATAGCTCC 

AGTTGCATCTCAATTAAGAAATGCTAATGGTAAAATGGATTTAGCTGATGGTGATGCTACACC 

AATAGTAGTTGTAGATGGAAAAGCTAAAACTATAAATGATGATGTAAAAGATTTCTTAGATG 

ATTCACAAGTTGATATAATAGGTGGAGAAAACAGTGTATCTAAAGATGTTGAAAATGCAATA 

GATGATGCTACAGGTAAATCTCCAGATAGATATAGTGGAGATGATAGACAAGCAACTAATGC 

AAAAGTTATAAAAGAATCTTCTTATTATCAAGATAACTTAAATAATGATAAAAAAGTAGTTAA 

TTTCTTTGTAGCTAAAGATGGTTCTACTAAAGAAGATCAATTAGTTGATGCTTTAGCAGCAGCT 

CCAGTTGCAGCAAACTTTGGTGTAACTCTTAATTCTGATGGTAAGCCAGTAGATAAAGATGGT 

AAAGtATTAACTGGTTCTGATAATGATAAAAATAAATTAGTATCTCCAGCACCTATAGTATTAG 

CTACTGATTCTTTATCTTCAGATCaAAGTGTATCTATAAGTAaAGTTCTTGATAAAGATAATGG 

AGAAAACTTAGTTCAAGTTGGTAAAGGTATAGCTACTTCAGTTATAAACAAAATGAAAGATTT 

ATTAGATATGTAA 
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SEQ ID No. 4 
(Strain 172450 DNA) 

ATGAAAAAAAGAAATTTAGCAATGGCTATGGCAGCTGTTACTGTAGTAGGTTCTGCTGCTCCA 

GTTTTTGCAGCAGCTTCAGATGTAATATCACTACAAGATGGTACAAATGATAAGTATACAGTA 

TCAAATACTAAAGCTAGTGACTTAGTAAAGGATATTTTAGCAGCACAAAACTTAACAACAGGT 

GCAGTTATTTTGAACAAAGATACAAAAGTTACTTTCTATGATGCAAATGAGAAAGATTCTTCA 

ACTCCAACTGGAGATAAAAAAGTTTATTCAGAACAAACTTTAACTACAGCTAATGGAAATGA 

AGATTATGTAAAGACAACTTTAAAAAATTTAGATGCAGGAGAATATGCTATTATAGATTTAAC 

TTATAATAATGCTAAAACTGTTGAAATTAAAGTAGTAGCAGCTAGTGAAAAAACAGTAGTTGT 

ATCTAGTGATGCGAAAAATAGTGCAAAAGATATAGCTGAAAAATATGTGTTTGAAGACAAAG 

ACTTAGAAAATGCACTAAAAACTATAAATGCCTCAGATTTCAGTAAAACTGATAGTTACTATC 

AAGTAGTTCTTTATCCAAAAGGAAAGAGATTACAAGGTTTCTCAACTTATAGAGCTACAAATT 

ATAATGAAGGAACTGCATATGGTAATACACCAGTAATATTAACTCTAAAATCTACTAGTAAGA 

GTAATTTAAAGACTGCAGTAGAAGAGTTACAAAAATTGAATGCTAGTTATTCTAATACTACAA 

CTTTAGCTGGTGATGACAGAATACAAACAGCTATAGAGATAAGTAAAGAATATTACAATAAT 

GATGGCGAGAAATCAGATCATTCAGCTGATGTTAAAGAGAATGTTAAAAATGTTGTATTAGTA 

GGTGCAAATGCACTAGTAGATGGATTAGTTGCGGCTCCTTTAGCAGCAGAAAAAGATGCTCCA 

CTATTATTAACTTCAAAAGATAAATTAGATTCGTCAGTAAAATCTGAAATAAAGAGAGTTTTA 

GACTTAAAAACTTCAACAGAAGTAACAGGAAAAACAGTTTATATAGCTGGTGGAGTTAATAG 

TGTATCTAAAGAAGTTGTAACAGAATTAGAATCAATGGGATTAAAAGTTGAAAGATTCTCAG 

GTGATGATAGATATGAAACTTCTTTAAAAATAGCAGGTGAAATAGGCTTAGATAATGATAAG 

GCTTATGTAGTTGGTGGAACAGGATTAGCAGATGCCATGAGTATAGCTTCAGTTGCTTCTACT 

AAATTAGATGGTAATGGTGTTGTAGATAGAACAAATGGACATGCTACTCCAATAGTTGTTGTA 

GATGGAAAAGCTGATAAAATATCTGATGACTTAGATAGTTTCTTAGGAAGCGCTGATGTAGAT 

ATAATAGGTGGATTTGCAAGTGTATCTGAAAAGATGGAAGAAGCTATATCAGATGCTACTGGT 

AAAGGCGTTACAAGAGTTAAAGGCGACGATAGACAAGACACTAACTCTGAAGTTATAAAAAC 

ATATTATGCTAATGATACTGAAATAGCTAAAGCTGCAGTTTTAGATAAAGATTCAGGTGCTTC 

AAGTAGTGATGCAGGAGTATTTAATTTCTATGTAGCTAAAGATGGATCTACAAAAGAAGATCA 

ATTAGTTGATGCATTAGCAGTAGGAGCTGTTGCTGGATATAAACTTGCTCCAGTTGTATTAGCT 

ACTGATTCTTTATCTTCTGATCAATCGGTTGCTATAAGCAAAGTTGTAGGAGAAAAATATTCTA 

AAGATTTAACACAAGTTGGTCAAGGAATAGCTAATTCAGTTATAAACAAAATGAAAGATTTAT 

TAGATATG 



SEQ ID No. 5 
(Strain 170324 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 
GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 
GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 
TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 
CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 
TTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAATCAAGCAGATGCAGAAG 
CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 
CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 
CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 
GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 
AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 
TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 
TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 
AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 
CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 
AAAATGATGGTATAGAGTCTAACTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 
ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 
CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 
AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 
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TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 
AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 
GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 
TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 
TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 
CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 
ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 
ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 
AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 
GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 
TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 
ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 
ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 
CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 
CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 
AAATGAAAGATTTATTAGATATGG 



SEQ ID No. 6 
(Strain 171448 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAA AAAAG CA 

GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGGAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAGCTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 

ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATGG 
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SEQ ID No. 7 

ATGAATAAGAAAAACTTAGCAATGGCTATGGCAGCAGTTACTG^^ 

ATATTTGCAGATAGTACTACGCCAGGTTATACTGTAGTGAAAAATGATTGGAAAAAAGCAGT 

AAAACAATTACAAGATGGGTTGAAAAATAAAACTATATCAACAATAAAGGTGTCTTTTAATG 

GAAACTCTGTTGGAGAAGTTACACCAGCCAGTTCTGGAGCAAAAAAAGCAGATAGAGATGCT 

GCAGCTGAAAAGTTATATAATTTAGTAAATACACAATTAGATAAACTAGGTGATGGAGATTAC 

GTTGACTTTGAAGTAACTTATAATTTAGCTACTCAAATAATTACAAAAGCAGAAGCAGAGGCA 

GTTCTTACAAAATTACAACAATATAATGATAAAGTACTTATAAATTCTGCAACAGATACAGTA 

AAAGGTATGGTATCTGATACACAAGTTGATAGCAAAAATGTTGCAGCTAACCCACTTAAAGTT 

AGTGATATGTATACAATACCATCTGCTATTACTGGAAGTGATGATTCTGGGTATAGTATTGCT 

AAACCAACAGAAAAGACTACAaGTTTATTGTATGGTACGGTTGGTGATGCAACTGCAGGTAAA 

GCAATAACAGTAGATACAGCTTCAAATGAAGCTTTTGCTGGAAATGGAAAGGTTATTGACTAC 

AATAAATCA1TCAAAGCAACTGTACAAGGAGATGGAACAGTTAAGACAAGCGGGGTTGTACT 

TAAAGATGCAAGTGATATGGCTGCAACAGGTACTATAAAAGTTAGAGTTACAAGTGCAAAAG 

AAGAATCTATTGATGTGGATTCAAGTTCATATATTAGTGCTGAAAATTTAGCTAAAAAATATG 

TATTTAATCCTAAAGAGGTTTCTGAAGCTTATAATGCAATAGTTGCATTACAAAATGATGGAA 

TAGAATCTGATTTAGTACAATTAGTTAATGGAAAATATCAAGTTATTTTCTATCCAGAAGGAA 

AAAGA1TAGAAACTAAATCTGCAGATATAATAGCTGATGCAGATAGTCCAGCTAAAATAACT 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATACAATAA 

TACTTACTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAATTAA 

GTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATGATGCAGTTAATAATATAG 

TATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCTTCAGAAAAAAC 

AGCTCCATrATTATTAACrrCAAAAGATAAATTAGATTCATCAGTAAAATCTGAGATAAAAAG 

AGrrATGAACTTAAAGAGTGATACTGGTATAAATACTTCTAAAAAAGTTTATTTAGCTGGTGG 

AGTTAATTCTATATCTAAAGATGTAGAAGATGAATTGAAAAATATGGGCCTTAAAGTTACTAG 

ATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAAATAGGTCTTGATAA 

TGATAAAGCATTTGTAGTTGGTGGTACTGGATTGGCAGATGCTATGAGTATAGCTCCAGTTGC 

TTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGAAAAGCAAAAGAAA 

TAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATAGGTGGAAAAAATA 

GCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAACTCCAGATAGAATA 

AGTGGAGATGACAGACAAGCAACTAATGCTGAAGnTTAAAAGAAGATGATTATTTCAAAGA 

TGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGA 

TGCATTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGC 

TACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGG 

AACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTT 

ATTAGATATGTAA 

SEQ ID No. 8 

ATGAATAAGAAGGATATAGCAATAGCTATGTC^^ 

GTATTTGCTGCTAGTAGTTTTACAGCAGATTATAATTATACTGTAGTGCAAGGAAAATATCAA 

AAAGTTATAACTGGATTACAAGATGGTTTAAAAAATGGAAAAATAACAAATATTGATGTAAT 

AITTGATGGAAGTTCAATTGGTGAGGTAGTGCCAGGTTCTGATGCTGCAGCTGCAGCTACTAA 

ATTAAAAAGTTTAGTTGATGATAAGTTAGATAACTTAGGTGATGGAAAATACGTTCAATTTAA 

TGTTACTTATACTACTAAATCTATAATAACTAAAGCAGAATTAAAAAATTATTATAATCAATT 

AGAAAGTAGTAAAGATAGAATACTTATAGGAAATGAACCTCAAGATACAGGAACTAAAGGTC 

TTATAAAAGCTGATACTGATGGTACTACTGCTGTTGCAGCAGCTGCACCATTGAAATTATCAG 

ATATATTTACGTTTAGTTATGATGAAGTAACAGGTGTACTTAAAGCAGAACCAACAAGTAAAG 

TAAGCGCTGGTAAAGTTCAAGGTCTAAAATATGGAAATACAGGAGCAACTAACTATACTTCTG 

GAGCTGAAATATCTGTTCCTACTACAGGCTTAACATTAACTGCTGATACAACTGCAACAACAG 

ATGTAAATATTTCTGATGTTATGAGTGCATTTAAATTTAATGGTACTGATACGATTAGTGGATT 

CCCAGCTGGTTCATCAGCTTCTACTCTTAGAGCAAGTATAAAAGTAATAAATGCAAAAGAAGA 

ATCTATAGATGTTGATTCAAGTTCACATAGAACAGCTGAAGATTTAGCTGAAAAATATGTATT 
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TAAACCAGAAGATGTGAATAAAACXrATGAGGCACTGACTGATTTATATAAAGAAGGTATAA 
CAAGTAATCTTATCACTCAAGATGGTGGAAAATATCAAGTTGTnTATTrGCTCAAGGAAAGA 
GATTAACTACTAAAGGAGCAACTGGAACTTTAGCAGATGAAAATTCTCCTCnTAAAGTAACAA 
TAAAAGCAGATAAAGTAAAAGACTTAAAAGATTATGTTGAAGATTTAAAAAATGCTAACAAT 
GGATATTCAAATTCTGTTGTTGTAGCAGGTGAAGATAGAATAGAAACAGCAATAGAGTTAAG 
TAGCAAATACTATAACTCTGATGATGACAATGCAATAACTAAAGATCCAGTTAACAATGTTGT 
TTTAGTTGGTTCTCAAGCTGTAGTTGATGGGCTTGTAGCTTCACCTTTAGCATCTGAAAAAAGA 
GCTCCTTTACTATTAACTTCAGCAGGAAAATTAGATTCAAGTGTTAAAGCTGAGTTGAAAAGA 
GTAATGGATTTAAAATCTACAACAGGTGTAAATACTTCTAAAAAAGTTTACTTAGCTGGTGGA 
GTAAACTCTATATCTAAAGATGTAGAAAATGAATTAAAAGATATGGGACTTAAAGTTACAAG 
ATTATCAGGAGATGATAGATATGAAACTTCTTTAGCTATAGCTGATGAAATAGGTCTTGATAA 
TGATAAAGCTTTTGTAGTTGGAGGAACAGGATTAGCGGATGCTATGAGTATAGCTCCAGTTGC 
TTCTCAATTAAGAAACTCAAATGGAGAACTTGACTTAAAAGGTGATGCAACTCCAATAGTAGT 
TGTTGATGGAAAAGCTAAAGATATAAATTCTGAAGTAAAAGATTTC1TAGATGATTCACAAGT 
TGATATAATAGGTGGTGTAAATAGTGTTTCTAAAGAAGTAATGGAAGCAATAGATGATGCTAC 
TGGAAAATCACCTGAGAGATATAGTGGAGAAGATAGACAAGCAACAAATGCTAAAGTTATAA 
AAGAAGATGATTTCTTTAAAAATGGAGAAGTTACAAACTTCTTTGTAGCTAAAGATGGTTCAA 
CTAAAGAAGATCAATTAGTAGATGCTTTAGCAGGTGCTGCAATTGCTGGTAACTTTGGTGTAA 
CAGTAGATAATGAAGGAAAACCTACAGTTGCTGATAAAAAAGCTTCTCCAGCACCAATTGTTT 
TAGCAACAGATTCTTTATCTTCTGATCAAAATGTAGCTATAAGTAAAGCTGTAAATGATGACG 
CTAATACTAAGAATCTAGTTCAAGTTGGTAAAGGTATAGCTACTTCAGTTGTAAGTAAAATAA 
AAGATTTATTAGATATG 



SEQ ID No. 9 
(Strain 170444 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTATTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGACATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 
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ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 
ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 
CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 
CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 
AAATGAAAGATTTATTAGATATGA 



SEQ ID No. 10 

(Strain 170426 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAGGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTrCAACACCTAGTGCTGTAGCTGTAAG 

TGGTnTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 

ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATG 
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2. A Claims l^os.: 

because they relate to parts of the International Appliualiuii that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, speoifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. 1 I Claims Nos.: 

because they are dependent claims and are not drafted in aooordanoa with the saoond and third 



Box II Observations where unity of invention Is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple Inventions in this international application, as follows: 

see additional sheet 

1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' seaichable claims. 

. I I As all searchable claims could be searched without effort Justifying an additional fSe, this Authority did not Invite payment 

this international Seart:h Heport 



. I I As only some of the required additional search fees ware timely paid by the applicant, 
I 1 covers only those claims for which fees were paid, speclfically claims Nos.: 



4. IMo required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

(9. 11. 21. 42, 44) - (1-8. 19. 20, 23-41, 52-54) - partial 

Remark on Protest The additional search fees were accompanied by the applicant's pro 

I I No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (gi^oups of) 
inventions in this International application, as follows: 

1. Claims: (9, 11, 21, 42, 44) - complete; (1-8, 19. 29, 
23-41, 52-64) - partial 



Clostridium difficile S layer protein (SlpA) comprising SEQ 
ID N0:1 and its corresponding gene (slpA), which comprises 
SEQ ID N0:3; epitopes, horaologs, derivatives, variants or 
fragments thereof. Chimeras comprising any of the previously 
mentioned polynucleotides/ (poly) peptides. Antibodies against 
those (poly) peptides. Vaccines comprising any of the former 
and methods for prophyl axi s /treatment of C. 
difficile-associated diseases based on the use thereof. 



2. Claims: (lO, 13, 14, 17, 18, 22, 43, 46, 47, 50, 
51) - complete: (1-8. 19, 20. 23-41. 
52-64) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID MOs: 5, 6, 9 or 10 and a polypeptide/peptide comprising 
SEQ ID N0:2. 



3. Claims: (12. 45) - complete; (1-8. 19. 2G, 23-37. 39-41. 
52-54. 56-64) - partial 



Idem as subject 1. but restricted to a gene comprising SEQ 
ID N0:4. 



4. Claims: (15, 48) - complete; (1-8, 19, 20, 23-37, 39-41, 
52-54, 56-64) - partial 



Idem as subject 1. but restricted to a gene comprising SEQ 
ID N0:7. 



5. Claims: (16, 49) - complete; (1-8. 19, 20, 23-37, 39-41. 
52-54, 56-54) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID N0:8. 



6. Claims: (65) - complete 

Use of inter! eukin 12 as an adjuvant in a C. difficile 
vaccine. 
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7. Claims: (66) - partial 

Use of humanised antibodies for passive vaccination of an 
individual with C. difficile infection. 

8. Claims: (66) - partial 

Use of serum for passive vaccination of an individual with 
C. difficile infection. 
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Continuation of Box 1.2 



Please notice that: 

1. The translations of the ORFs contained in appendices 1-8 are not 
included in the sequence listing and, therefore, the one corresponding to 
the first invention has not been searched. In case the applicant decided 
to pay additional fees, he should be aware of the fact that the same will 
apply to the other inventions.. 

2. Claims 39 and 56 refer to SEQ ID WOs:3-10 as "amino acid sequences" 
but, actually, they are nucleotidic sequences. 

3. The sequence numbering is confusing. The sequence identity numbers 
mentioned in the description and claims do not correspond with those of 
the sequence listing (example: SEQ ID N0:1 of the description is a 
peptide which appears under SEQ ID N0:9 of the sequence listing). 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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